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UNIVERSITY OF LO NDON, UNIV ERSITY COLLEGE 

DEPARTMENT OF APPLIED STATISTICS 
Tha Biometric Laboratory 

(assisted by a grant from the Worshipful Company of Drapers) 

Until the phenomena of any branch of knowledge have been submitted to measurement 
and number it cannot assume the status and dignity of a science. FRANCIS GALTON. 

Under the direction of Professor Karl Pearson, F.R.S. Senior Lecturer s E. S. 
Pearson, D.Sc. Lecturer: G. M. Mohant, D.Sc. Assistants: E. C, Fieller, B.A., 
S. J. Pretori us, M.Sc. and Ida McLearn. Hon. Research Assistant: Margaret 
V. Pearson, M.A. Crewdson-Benington Student: Marjorie F. Hoadley, JLA. 

This laboratory provides a complete training in modern statistical methods and 
i$ especiayy arranged so as to assist research workers engaged on biometric problems. 

The Francis Galton Eugenics Laboratory 

National Eugenics is the study of agencies under social control\ that may improve 
or impair the racial qualities of future generations , either physically or mentally . . 

The Laboratory was founded by Sir Francis Galton and is under the supervision 
of Professor Karl Pearson, F.R.S. Assistant Professor: Ethel M. Elderton. 
Reader in Medical Statistics : Percy Stocks, M.A., M.D. Hon. Galton Research 
Fellow: Julia Bell, M.A., M.R.C.S., M.R.C.P. Assistants : M. Noel Karn, M.A., 
JOYCE TOWNEND. Secretary: M. MOUL, M.A. * 

It was the intention of the Founder, that the Laboratory should serve (i)as a 
storehouse of statistical material bearing on tjje mental and physical conditions in 
man, and the relation of these conditions to inheritance and environment; (ii) as 
a centre for the publication or other form of distribution of information concerning 
National Eugenics; (Hi) as a school for training and assisting research-workers in 
special problems in Eugenics. 

CAMBRIDGE UNIVERSITY"PRESSTRACTS FOR COMPUTERS 

I. Tables of the Digamma and Trigamma Functions. By Eleanor Pair- 
man, M.A. Price 3 s. net. 

°° j 

Tables for summing vS » 2 . - ——— —--. where the p y s and 0’s 

*+9i)(p*t + q*) ...{p n t+q n ) r y 

are numerical factors. 

II. On the Construction of Tables and on Interpolation. Part I. Univariate 
Tables. By Karl Pearson, F.R.S. Price 3 s. 9 d. net. 

III. On the Construction of Tables and on Interpolation. Part II. Bivariate 
Tables. By Karl Pearson, F.R.S. Price 3j. 9 d. net. 

IV. Tables of the Logarithms ofvthe Complete P-function to Twelve Figures. 
Reprint with Portrait of A. M. LEGENDRE. Price 3s.gd. net. 

V. Table of Coefficients of Everett’s Central-Difference Interpolation For¬ 
mula. By A. J. Thompson, Ph.D. Price 3s. gd. net. 

VI. Smoothing. By E. C. Rhodes, M.Sc. Price 3s. gd. net. 

VI!* The numerical Evaluations of the Incomplete B-Function or of the Integral 

I ~xf- x dx for Ranges of x between o and 1. By H. E. Soper, M.A. 

J 0 . Price 3 s. gd. net. 

VIII. Table of the Logarithms of the Complete T -Function (to ten decimal 
places) for Argument 2 to 1200 beyond Legendre’s Range (Argument 1 to 2). 
By Egon S. Pearson, Q.Sc. Price 3s. gd. net. 

IX. Log P (x) from x *= 1 to 50 9 by intervals of O’OI. By JOHN BROWNLEE, M.D., 
D.Sc. Price 3 s. gd. net . 

X. On Quadrature and Cubature or on Methods of Determining Approximately 
Single and Double Integrals. By J. O. Irwin, M.Sc. Price 7s. 6d. net. 

XI. Logarithmetica Britannica. A Standard Table ofJLogarithms to Twenty 
Decimal Places. Part IX. Logarithms of Numberlr 90,000 to 100,000. By 
A. J. Thompson, Ph.D. Price 15s. net. 

XII. Tables of the Probable Error of the Coefficient of Correlation. By Karl 

Holzinger; Ph.D. Price 3s . gd. net. 

XIII. Bibliotheca Tabularum Mathematicarum, being a Descriptive Catalogue of 
Mathematical Tables. Part I. A, Logarithms of Numbers. By James Hender¬ 
son,.Ph.D. Pricegs.net. 

XIV. Logarithmetica Britannica. A Standard Table of Logarithms to Twenty 
Decimal Places. Part VIIL ' Logarithms of Numbers 80,000 to 90,000. By 
A. J. Thompson, Pii.D. Price 15 net. 

XV. Random Sampling Numbers. By L. H. C. Tippett, M.SC.; with a Foreword 
by Karl Pearson. Price 3s. gd. net. * 

XVI. Logarithmetica Britannica. A Standard Table of Logarithms to Twenty 
Decimal Places. Part IV. Logarithms of Numbers 40,000 to 50,000. By 
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uamortag* u mmrmty jrrmm, jz : mar JLame, juon&m, js.v. 4, ana their-Agents, 
an now (he sole agents for the sale of the following publications of the 
Gotten and Biometric laboratories. University of London: 

Biometric Laboratory Publications 
Drapers* Company Research Memoirs. 


Biometric 

"Mathematical Contributions to the 

Theory of Evolution.—XIII. On the Theory 
of Contingency and its Relation to Associa¬ 
tion and Normal Correlation. By Kart. 
Pearson, F.R.S. Price 10a net. 
"Mathematical Contributions to the 
Theory of Evolution.—XIV. On the Theory 
of Skew Correlation and Non-linear Regres¬ 
sion. By Karl Pearson. Price 10*. net. 

Mathematical Contributions to the 
Theory of Evolution.—XV. On the Mathe¬ 
matical Theory of Random Migration. By 
Karl Pearson, F.R.S., with the assistance 
of John Blakeman, M.Sc. Price 5*. net. 

Mathematical Contributions to the 
Theory of Evolution.—XVI. On Further 
Methods of Measuring Correlation. By 
Karl Pearson, F.R.S. Price 5*. net. 

Mathematical Contributions to the 
Theory of Evolution.—XVII. On Homo- 
typosis in the Animal Kingdom. A Co- 
oik? rati ve Study. [In preparation 

Albinism in Man. By Karl Pearson, 

E. Nettleship, and C. H. Usher. Text, 
Part 1, and Atlas, Part I. Price 35*. net. 

* Sold only with 


Series . 

VTI. Mathematical Contributions to the 

Theory of Evolution.—XVIII. On a Novel 
Method of Regarding the Association of 
two Variates classed in Alternative Cate¬ 
gories. By Karl Pearson. Price 5s. net. 
VHL Albinism in Man. By Karl Pearson, 
E. Nettleship, and C. H. Usher. Text, 
Part II, and Atlas, Part II. Price 30*. net. 

IX. Albinism in Man. By Karl Prarbon, 

E. Nettleship, and C. H. Usher. Text, 
Part IV, and Atlas, Part IV. Price 21 *. net. 

X. A Monograph on the Long Bones 

of the English Skeleton. By Karl 
Pearson, F.R.S., and Julia Bell, M.A. 
Part I. The Femur [of Man], Text I and 
Atlas of Plates I. Pi-ice 30*. net. 

XI. A Monograph on the Long Bones 

of the English Skeleton. By Karl 
Pearson, F.R.S., and Julia Bell, M.A. 
Pari I, Soction II. The Femur [with Special 
Reference to other Primate Femora']. Text 11 
and Atlas of Plates II. Price 40*. net. 

XII. On the Sesamoids of the Knee-Joint. 

By Karl Pearson, F.R.S., and Adelaide 
G. Davin, B.Sc. (Reprinted from Bio - 
metrilra. ) Text and 37 Plates. Price 30*. net. 
complete sets. 


Studies in National Deterioration. 


On the Relation of Fertility in Man 

to Social Status, and on the changes in this 
Relation that have taken place m the last 
50 years. By David Heron, M.A., D.Sc. 

[Out of Print 

A First Study of the Statistics of 

Pulmonary Tuberculosis (Inheritance). By 
Karl Pearson, F.R.S. [Out of Print 

A Second Study of the Statistics of 
Pulmonary Tuberculosis. Marital Infection. 
By Ernest G. Pope and Karl Pearson. 
With an Appendix on Assortative Mating 
by Ethel M. Elderton. Price 5*. net. 

On the Relationship of Health to the 
Psychical and Physical Characters in School 
Children. By Karl Pearson, F.R.S. Price 
15*. net. 

On the Inheritance of the Diathesis 

of Phthisis and Insanity. A Statistical 
Study based upon the Family History of 
1,500 Criminals. By Charles Goring, 
M.D., B.Sc. Price 4s. net. 

A Third Study of the Statistics of 
Pulmonary Tuberculosis. The Mortality 
of the Tuberculous and Sanatorium Treat¬ 


ment. By W. P. Elderton, F.I.A., and 
S. J. Perry, A.I.A. Price 4*. net. 

VII. On the Intensity of Natural Selection 
in Man. (On the Relation of Darwinism 
to the Infantile Death-rate.) By E. C. 
Snow, D.Sc. Price 4s. net. 

VHI. A Fourth Study of the Statistics of 
Pulmonary Tuberculosis : the Mortality of 
the Tulmrculous: Sanatorium and Tuber¬ 
culin Treatment. By W. Palin Elderton, 
F.I.A., and Sidney J. Perry, A.I.A. 
Price 4*. net. 

IX. A Statistical Study of Oral Tern- 

]>eratures in School Children with special 
reference to Parental, Environmental and 
Class Differences. By M. H. Williams, 
M.B., Julia Bell, M.A., and Karl 
Pearson, F.R.S. Price 6*. net. 

X. Study of the Data provided by a 

Baby-Clinic in a large Manufacturing Town. 
By Mary Noel Karn and Karl Pearbon. 
Price 15*. net. 

XI. Blood Pressure in Early Life. A 

Statistical Study. By Percy Stocks, M.D., 
D.P.H., assisted by M. Noel Karn. Price 
12*. net. 


Technical 

I. On a Theory of the Stresses in Crane 

and Coupling Hooks with Experimental 
Comparison with Existing Theory. By 

E. S. Andrews, B.Sc. Eng., assisted by 
Karl Pearson, F.E.S. Issued. Price 4*. net. 

IL On some Disregarded Points in the 
Stability of Masonry Dams. By L. W. 
Atcherley, assisted by Karl Pearson, 

F. R.S. Issued. Price 7s. net. Sold only 
with complete sets. 

HI. On the Graphics of Metal Arches 

with special reference to the Relative 
Strength of Two-pivoted, Three-pivoted 
and Built-in Metal Arches. By L. W. 
Atcherley and Karl Pearson, F.R.S. 
Issued. Price 5s. net. 

IV. On Torsional Vibrations in Axles 

and Shafting. By Karl Pearson, F.R.S. 
Issued . Price 6*. net . 


Series . 

V. An Experimental Study of the 

Stresses in Masonry Dams. By Karl 
Pearson, F.R.S. and A. F. Campbell 
Pollard, assisted by C. W. Wheen, B.Sc. 
Eng., and L. F. Richardson, B.A. Issued. 
Price 7s. net . 

VI. On a Practical Theory of HlHptic and 

Pseudo-elliptic Arohes, with special refer¬ 
ence to the ideal Masonry Aren. By Karl 
Pearson, F.R.S., W. D. Reynolds, B.Sc, 
Eng., and W. F. Stanton, B.Sc. Eng. 
Issued. Price 6s . net. 

VH. On the Torsion resulting from Flexure 

in Prisms with Cross-sections of Uni-axial 
Symmetry only. By A. W. Young, Ethel 
M. Elderton and Karl Pearson, F.R.S. 
Issued . Price 7s . 6d . net. 



Drapers* Company Research Memoirs—( cont .). 

Questions of the Day and of the Fray. 

VII. Mendelism and the Problem of Mental 


I. The Influence of Parental Alcoholism 

on the Physique and Ability of the Off¬ 
spring. A Reply to the Cambridge Econo¬ 
mists. By Karl Pearson. Price 1*. 6 d. net. 

II. Mental Defect, Mal-Nutrition, and 

tho Teacher’s Appreciation of Intelligence. 

A Reply to Criticisms of tho Memoir on 
‘ The Influence of Defective Physique and 
Unfavourable Home Environment on the 
Intelligence of School Children.’ By David 
Heron, D.Sc. Price 1#. 6 d. net. 

III. An Attempt to correct the Misstate¬ 

ments made by Sir Victor Horsley, F.R.S., 
F.R.C.S., and Mary D. St urge, M.D., in 
their Criticisms of the Memoir : ‘ A First 
Study of the Influence of Parental Alcohol- -- 
isni,\Stc. By Karl Pearson. Pr%eelt.Gd.net. 1 

IV. The Fight against Tuberculosis and 1 

the Death-rate from Phthisis. By Karl 
Pearson, F.R.S. [Out of print 

V. Social Problems: Their Treatment, | xi. 

Past, Present and Future. By Karl 
Pearson, F.R.S. Price Is.6d. net. 

VI. Hugenics andPublic Health. Lecture to 

the York Congress of the Royal Sanitary In¬ 
stitute. By Karl Pearson. Price\s.6d. net. { 


Defect. I. A Criticism of Recent American 
Work. By David Heron, D.Sc. (Double 
Number*) Price 2*. net. 

VIII. Mendelism and the Problem of Mental 

Defect. II. The Continuity of Mental 
Defect. By Karl Pearson, F.R.S., and 
Gostav A. Jaederholm, Ph.D. Price 
Is. 6c?. net. 

IX. Mendelism and the Problem of Mental 

Defect. III. On the Graduated Character 
of Mental Defect, and on the need for 
standardizing J udgraents as to the Grade 
of Social Inefficiency which shall involve 
Segregation. By Karl Pearson, F.R.S. 
(Double Number.) Price 2*. net. 

The Science of Man. Its Needs and its 
Prospects. By Karl Pearson, F.R.S. Being 
the Presidential Address to Section H of 
the British Association, 1920. Price la. 6d. 
net. 

Francis Galton, A Centenary Appre¬ 
ciation. With portrait sketch. By Karl 
Pearson, F.R.S. Price 2*. net. 

XII. Charles Darwin, 1800-1882. An Ap¬ 
preciation. With ail unpublished portrait. 
By Karl Pearson, F.R.S. Price 2s. 6 d. net. 


Eugenics Laboratory Publications 


I. 


II. 


III. 


IV. 


V. 


VI. 


VII. 


VIII. 


IX. 


X. 


XI. 


XII. 


MEMOIR SERIES. 


The Inheritance of Ability. By Edgar 
Schuster, D.Sc., Formerly Galton Research 
Fellow, and E. M. Elderton. [(hit of Print 

A First Study of the Statistics of 
Insanity and the Inheritance of tho Insane 
Diathesis. By David Heron, D.Sc., Form¬ 
erly Galton Research Fellow. [Out of Print 

The Promise of Youth and the 
Performance of Manhood. By Edgar 
Schuster, D.Sc. [Out of Print 

On the Measure of the Resemblance 
of First Cousins. By Ethel M. Eldekton, 
Galton Research Fellow, assisted by Karl 
Pearson, F.R.S. Price 5s. net. 

A First Study of the Inheritance of 
Vision and of the Relative Influence of 
Heredity and Environment on Sight. By 
Amy Barrington and Karl Pearson, 
F.R.S. Price 5*. net. 

Treasury of Human Inheritance. 

Farts 1 and II (double part). (Diabetes in¬ 
sipidus, Split-Foot, Poly dactyl ism, Bracliy- 
daetylism, Tuberculosis, Deaf-Mutism, and 
Legal Ability.) Price 14*. 'net. 

On the Relationship of Condition of 
the Teeth in Children to Factors of Health 
and Home Environment. By E. C. Rhodes, 
D.Sc. Price 9*. net. 

The Influence of Unfavourable Home 

Environment and Defective Physique on 
the Intelligence of School Children. By 
David Heron, M.A., D.Sc. [Out of print 

The Treasury ofHuman Inheritance. 
Part III (Angioneurotic Oedema, Herma¬ 
phroditism, Deaf-Mutism, Insanity, Com¬ 
mercial Ability.) Price 10*. net. 

The Influence of Parental Alcoholism 
on the Physique and Intelligence of the 
Offspring. By Ethel M. Elderton, as¬ 
sisted by Karl Pearson. Price 6*. net. 
The Treasury ofHuman Inheritance. 
Part IV. (Cleft Palate, Hare-Lip, Deaf- 
Mutism, and Congenital Cataract.) Price 
10*. net. 

The Treasury ofHuman Inheritance. 

PartsVandVI. (Haemophilia.) Pncel5«.ne*. 


XIII. A Second Study of the Influence of 

Parental Alcoholism on the Physique and 
Intelligence of the Offspring. By Karl 
Pearson, F.R.S., and Ethel M. Elderton. 
Price 6*. net. 

XIV. A Preliminary Study of Extreme 

Alcoholism in Adults. By Amy Barring¬ 
ton and Karl Pearson, F.R.S., assisted 
by David Heron, D.Sc. Price 6*. net. 

XV. The Treasury of Human Inheritance. 

Dwarfism, with 49 Plates of Illustrations 
and 8 Plates of Pedigrees. Price 16*. net. 

XVI. The Treasury of Human Inheritance. 

Prefatory matter and indices to Vol. 1. 
With Frontispiece Portraits of Sir Francis 
Galton and Ancestry. Price 5*. net. 

XVII. A Second Study of Elxtreme Alco¬ 
holism in Adults. With special reference 
to the Horne-Office Inebriate Reformatory 
data. By David Heron, D.Sc. Price 7s. (Id. 
net. 

XVIII. On the Correlation of Fertility with 

Social Value. A Cooperative Study. 
Price 7s. 6d. net. 

XIX—XX. Report on the English Birthrate. 

Part I. England, North of the Humber. 
By Ethel M. Elderton, Galton Research 
Follow. Price 9*. net. 

XXI. The Treasury of Human Inheritance. 

Vol. II (Nettleship Memorial Volume) 
Heredity of Anomalies and Diseases of the 
Eye. Part I. Price 46*. net. 

XXII. TheTreasury of Human Inheritance. 

Vol. III. Parti. Hereditary Disorders of 
Bone Development. Price 46*. net. 

XXHI. The Treasury ofHuman Inheritance. 

Vol. II (Nettleship Memorial Volume). 
Part II. Colour Blindness. Price 46*. net, 

XXIV. The Treasury of Human Inheritance. 

Vol. II (Nettleship Memorial Volume). 
Part III. Blue Sclerotics and Fragility of 
Bone. Price 36*. net. 

XXV. On the Relative Value of the Factors 

which influence Infant Welfare. By Ethel 
M. Elderton. (Reprinted from Annals of 
Eugenics.) Price 21*. net. 

(ii) 



Eugenics Laboratory Publications— {coat.). 

Vol. I of The Treasury of Human Inheritance. Buckram cases for binding can be purchased at 6s. with 
impress of the bust of Sir Francis Galton. An engraved portrait of Sir Francis Galton can be obtained by sending 
a postal order for 3s. 6d. to the Secretary, Galton Laboratory, University College, London, W.C.l. 

Noteworthy Families. By Francis Galton, F.R.S. and Edgar Schuster. {Reissue.) Price 9s. net. 
Life History Album. By Francis Galton, F.R.S. Second Edition. {New Issue.) Price 12 s. net. 

LECTURE SERIES. 


1. The Scope and Importance to the 

State of the Science of National Eugenics. 
By Karl Pearson, F.R.S. Third Edition. 
Price 1*. 6d. net. 

n. The Groundwork of Eugenics. By Karl 
Pearson, F.R.S. Second Edition. Price 
Is. 6 d. net . 

III. The Relative Strength of Nurture and 

Nature. Much enlarged Second Edition. 
Part I. The Relative Strength of Nurture 
and Nature. By Ethel M. Eldkrton. Part II. 
Some Recent Misinterpretations of the Pro¬ 
blem of Nurture and Nature. By Karl 
Pearson, F.R.S. Price 2 s. 6 d. net. 

IV. On the Marriage of First Cousins. By 

Ethel M. Elderton. Price 1*. G<i. net. 

V. The Problem of Practical Eugenics. 

By Karl Pearson, F.R.S. Price 1a 6d. net. 

VI. Nature and Nurture, the Problem of 

the Future. By Karl Pearson, F.R.S. 
Second Edition. Price 1*. Gd. net. 


VTI. The Academic Aspect of the Soienoe 

of National Eugenics. By Karl Pearson. 
Price 3s. net. Sold only with complete sets. 

VIII. Tuberculosis, Heredity and Environ¬ 

ment. By Karl Pearson. Price 1a (id. net. 

IX. Darwinism, Medical Progress and Eu¬ 
genics. The Cavendish Lecture, 1912. By 
Karl Pearson, F.R.S, Price Is. 6d. net. 

X. The Handicapping of the First-born. 

By Karl Pearson, F.R.S. Price 2 s. 6 d. net. 

XI. National Life from the Standpoint of 

Science. (Third Issue.) By Karl Pearson, 
F.R.S. Price Is. 6c?. net. 

XII. The Function of Science in the Modem 

State. By Karl Pearson. Price 2a. net. 

XIII. Sidelights on the Evolution of Man. 

By Karl Pearson, F.R.S. Price 3«. net. 

XIV. The Right of the Unborn Child. By 

Karl Pearson, F.R.S. Price 3a. net. 


Walter Raphael Weldon. 1860 — 1906 . A Memoir. By Karl Pearson, F.R.S. Price 6a. net 
{Gd. postage). 

A few copies of the following are still available : 

The Skull and Portraits of Sir Thomas Browne. By Miriam Tildesley. Price Three Guineas net. 
The Skull and Portraits of King Robert the Bruce. By Karl Pearson. Price One Guinea net. 
The Skull and Portraits of George Buchanan. By Karl Pearson. Price One Guinea net. 

The Skull and Portraits of Henry Stewart, Lord Darnley. By Karl Pearson. Price £1. la. ad. net. 
Application should be made to the Secretary, Biometric Laboratory, University College, London, W.C. 1, 


At the Cambridge University Press, Fetter Lane , E.C. 4. 


The Chances of Death and other Studies in Evolution 


By KARL PEARSON, F.R.S. 


Vol. I 

1. The Chances of Death. 2. The Scientific 
Aspect of Monte Carlo Roulette. 3. Reproduc¬ 
tive Selection, 4. Socialism and Natural Selec¬ 
tion. 5. Politics and Science. 6. Reaction. 
7. Woman and Labour. 8. Variation in Man 
and Woman. 


Reissue, Price 30/- net • 

Vol. II 

9. Woman as Witch. Evidences of Mother- 
Right in the Customs of Mediaeval Witchcraft. 
10. Ashiepattle, or Hans seeks his Luck. II. 
Kindred Group Marriage. Parti. Mother Age 
Civilisation. Part II. General Words for Sex 
and Kinship. Part III. Special Words for Sex 
and Relationship. 12. The German Passion 
Play: A Study in the Evolution of Western 
Christianity. 


Mounted Charts of the Weight and Health 
of Male and Female Babies 


Price 7s . 6d. net the pair, suitable for the walls of Baby-Clinics, or for plotting the 
growth of individual babies to mark their progress. 


The following works prepared in the Biometric Laboratory 
can be obtained from H.M. Stationery Office. 

The English Convict, A Statistical Study. By Charles Goring, M.D. 

Text. Price 9s. Tables of Measurements (printed by Convict-Labour), Price 5s. 

The English Convict. An Abridgment, with an Introduction by Karl Pearson, F.R.S. . Price 3s. 
Tables of the Incomplete T-Function. Edited with an Introduction by Karl Pearson, F.R.S, 
Price £2. 2s. 0 d. or by Post £2. 2s. 9 d. 

(iii) 


AT THE CAMBRIDGE UNIVERSITY PRESS 


THE LIFE, LETTERS, AND LABOURS 
OF FRANCIS GALTON 

By KARL PEARSON, F.R.S. 

GALTON PROFESSOR, UNIVERSITY OF LONDON 

Volume I. Birth 1822 to Marriage 1853. With 5 Pedigree Plates 
& 72 Photographic Plates, Frontispiece & 2 Text-figures 
Price, Bound in Buckram, 36s* net 


“It is not too much to say of this book that 
it will never cease to be memorable. Never 
will man hold in his hands a biography 
more careful, more complete.”— The Times 
"A monumental tribute to one of the most 
suggestive and inspiring men of modem 
times*”— Westminster Qazette 


"It was certainly fitting that the life of the 
great exponent of heredity should be written 
by his great disciple, and it is gratifying 
indeed to find that he has made of it, what 
may without exaggeration be termed a great 
book*”— Daily Telegraph 


Volume II. Letters and Labours of Middle Life* With 50 Plates 
& many Figures in the Text 

Price , Bound in Buckram, 45$. net 


Chapter VIII. Transition Studies: Art 
of Travel, Geography, Climate. 

Chapter IX. Early Anthropological Re¬ 
searches. Transition from Geography 
to Anthropology. 

Chapter X. The Early Study of Heredity: 
Correspondence with Alphonse de 
Candolle and Charles Darwin. 


Chapter XI. Psychological Investiga¬ 
tions. Transition from Physical to 
Psychical Anthropology. 

Chapter XII. Photographic Researches 
and Portraiture. 

Chapter XIII. Early Statistical Investiga¬ 
tions with regard to Anthropology. 
Transition to Statistics as funda¬ 
mental to Biological Enquiry. 


"For the student of the History of Science, as well as for the student of Galton, this 

volume is of prime importance.The volume is important and deeply interesting. 

It is splendidly illustrated*”— Qlasgow Herald 

"Galton’s personality and achievements have taken their place in the history of 
science, and more than justify the sumptuous ‘Life, Letters, and Labours’ on which 
Professor Pearson has lavished special knowledge and labour.”— The Times Literary 
Supplement 

"It is a wholly worthy memorial of a very great man.”— Science 

"We prophesy that Pearson’s Life of Qalton will be ranked by our descendants not 
very far behind Boswell’s * Johnson,’ and Trevelyan’s ‘ Macaulay ’.’’—British Medical Journal 

" If our race continues to progress in the right direction, our descendants of, say, five 
or ten centuries hence will be insatiable in their need of information about such men 
as GALTON and DARWIN. They will bless Pearson for his devotion* If the great¬ 
ness of a man is to be measured by the product of his originality by his energy—and 
this seems the right way of measuring it—GALTON is certainly a very great man and 
his greatness will increase and not decrease as years and centuries go by.”— Isis 


Volume III, shortly. 




NEW ISSUES OF THE GALTON AND BIOMETRIC 

LABORATORIES 

THE TREASURY OF HUMAN INHERITANCE. 

Vol. II. Part III. (Nettleship Memorial Volume.) Blue Sclerotics and 
Fragility of Bone. By Julia Bell, M.A., M.R.C.S., M.R.C.P. Hon. 
Galton Research Fellow. 58 pp. of Text, Chronological Bibliography 
of 150 titles, Figures of 102 pedigrees on 6 Plates and 17 Illustrative 
Plates, two in colours. Price Thirty-six shillings net. 

Tracts for Computers 

XIII. BIBLIOTHECA TABULARUM MATHEMATICARUM 

being a Descriptive Catalogue of Mathematical Tables. Part I. 
Logarithms of Numbers. By James Henderson, Ph.D. Price 9s. net. 

XV. RANDOM SAMPLING NUMBERS. By L. H. C. Tippett, 
M.Sc. With a Foreword by the Editor. Price 3s. 9d. net. 

XVI. LOGARITHMETICA BRITANNICA. A Standard Table 
of Logarithms to 20 Decimal Places. By A. J. Thompson, Ph.D. 
Part IV. Numbers 40,000 to 50,000. Price 15s. net. 
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Part II will contain all the Tables issued in Biometrika during the last ten years 
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PRESS NOTICES OF THE FIRST EDITION 
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matical analysis the highly variable data of biological observation. The immediate cause for congratulation is, 
therefore,not that the tables have been done but that they have been done so well....The volume is in¬ 
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“The whole work is an eloquent testimony to the self-effacing labour of a body of men and women who 
desire to save their fellow scientists from a great deal of irksome arithmetic; and the total time that will be 
saved in the future by the publication of this work is, of course, incalculable....To the statistician these 
tables will be indispensable.’ —Journal of Education 

“The issue of these tables is a natural outcome of Professor Karl Pearson’s work, and apart from their 
value for those for whose use they have been prepared, their assemblage in one volume marks an interesting 
stage in the progress of scientific method, as indicating the number and importance of the calculations which 
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By Julia Bell, M.A., M.R.C.S., M.R.C.P. 143 pp. of Text, Chrono¬ 
logical Bibliography of 425 titles, Figures of 235 pedigrees on 15 
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pigmentosa and allied conditions, congenital stationary night-blindness, ana ghomu retinae. Part II, which 
is now published, is devoted to colour-blindness, and under the capable authorship of Dr Julia Bell 
exhibits to the full the high standard of workmanship that we are accustomed to expect of the Cambridge 
University Press. The volume, indeed, may be considered as an Edition de luxe , which will give equal satisfac¬ 
tion to the bibliophile and the man of science. An excellent reproduction of C. Turner’s engraving of the 
portrait of Dalton by Lonsdale forms an appropriate frontispiece, since Dalton was the first to give a scientific 
description of colour-blindness.”— British Medical Journal 
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Farbenblindheit. Klar und schon ist auch die cinleitende historische Darstellung der Lehre vom Farbensinn. 
Man kann die Verfasserin und den Verlag nur begliickwunschen zu dieser auch in der Form hervorragenden 
Leistung, der vollstandigsten, die unser Fach besitzt und die dies fur die Erblichkeitslehre so wichtige 
Kapitel in einer sehr erwiinschten Weise zusammenfasst.” 
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the Introduction and the historical notes throughout the bibliography will render the Tract of interest to all 
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“Even the professional computer of to-day does not find it by any means easy to keep his knowledge of 
tables up to date. The last two decades have witnessed the complete modernising of our equipment of 
logarithmic, trigonometrical and calculating tables....The computer is frequently at a loss to know where to 
turn for information concerning these new tables, and for guidance as to the best tables to use in his particular 
problems. The work before us is designed to satisfy this much-felt need, and in our opinion, achieves its 
object admirably....Because of the scarcity of comprehensive literature on the subject, and not less because 
of its intrinsic merits, we welcome Mr Henderson’s production.” 

Journal of the British Astronomical Association 

“This is a very complete and well executed index to published tables of the logarithms of numbers. 
Each table is described in such a way as to give the computer all the information he needs to decide whether 
it would be useful in his work. Conversely, if the computer has definite requirements in mind for any special 
task, this volume will at once call to his attention the best tables for his purpose. It should be on the shelves 
of every institution that has a variety of computing to perform.”— The Astronomical Journal 


(ri) 



The Journal of the Galton Laboratory for National Eugenics 

ANNALS OF EUGENICS 

A JOURNAL FOR THE SCIENTIFIC 
STUDY OF RACIAL PROBLEMS 

Edited by Karl Pearson, assisted by Ethel M. Elderton 

I have no Faith in anything short of actual Measurement and the Rule of Three ,— Charles darwin 

ISSUED BY THE GALTON LABORATORY FOR NATIONAL EUGENICS, UNIVERSITY 
OF LONDON, AND PRINTED AT THE UNIVERSITY PRESS, CAMBRIDGE 

Now Ready , Vols . /, II; and Vol, III 

A VOLUME of the Annals of Eugenics will oontain about 400 pages with plates 
and tables. An endeavour will be made to issue a volume annually. 

The subscription price, payable in advance, is 50 s, net per volume (including 
packing and postage). All subscriptions must be prepaid, i.e. paid before issue of 
the first part of a volume, otherwise wrapper prices are charged. Cheques should 
be made payable to the Galton Laboratory for National Eugenics, they should be 
crossed “Annals of Eugenics and sent to The Secretary to the Galton Labo¬ 
ratory, University College, London, W.C. 1 . No foreign cheques can be accepted 
unless they are drawn in sterling, properly stamped , and payable at a London agency. 


Volume III, Parts I & II. April, 1928. 35/- net (including postage) 

CONTENTS 

Frontispiece, “Annals of Eugenics” Portrait Series No. III. Sir Francis Galton in 1909, aged 87. 

I. The Problem of Alien Immigration into Great Britain Illustrated by an Examination of Russian 
and Polish Jewish Children. By Karl Pearson and Margaret Moul. (With twenty-two figures 
in the text.) Part III. Special Eye Examination. Section E. Environment and Ocular Characters, 
Influence of Personal Cleanliness, Cleanliness of Home, Lighting and Ventilation of Rooms, Crowding, 
Rent, Familial Income and other Factors on Ocular Characters. II. Retinitis pigmentosa and 
Leber’s Hereditary Optic Atrophy occurring in Cousins. By Ida C. Mann, M.B., B.S., F.RC.S. 
III. Fresh Evidence on the Inheritance Factor in Tuberculosis. By Percy Stocks, M.D., Reader 
in Medical Statistics, Univ. of Lond., assisted by Mary N. Karn, MA. Part II. IV. On the 
Relative Value of the Factors which Influence Infant Welfare. An Inquiry by Ethel M. Elderton, 
based on Data provided by Dr A. G. Anderson, M.O.H., Rochdale; Dr Wm. Arnold Evans, M.O.H., 
Bradford; Dr Alfred Greenwood, M.O.H., Blackburn; Dr H. O. Pilkington, M.O.H., Preston, 
and Dr C. H. Tattersall, M.O.H., Salford. Part III. Relative Influence of various Factors on 
Infant Health. Influence of Parental Health, Habits of Parents, Cleanliness of Home, Place in 
Family, Age of Mother, Employment of Mother, Breast and Artificial Feeding, Economic Conditions, 
Housing Conditions, etc. on Infant Health. V. Influence of Age and Environment on Parental 
Health and Habits, being an Appendix to the previous paper. By Ethel M. Elderton. VI. Infant 
Mortality in the Metropolitan Boroughs in relation to Occupations. By Percy Stocks, M.D. 

Issued by 

THE GALTON LABORATORY FOR NATIONAL EUGENICS 
University College, London, W.C. 1 

(vii) 




JOURNAL OF ANATOMY 

Conducted by Professor E. Barclay-Smith ; Professor J. P. Hill; Professor J. T. Wilson; 

Professor William Wright; Professor Sir Arthur Keith. 

Volume LXIV, Part II, January 1930. 10«. net. 

CONTENTS 

The Conducting System of the Bird's Heart (with Plates I-IV and 4 Text-figures). By Francis Davies, M.D. 

Le Neopallium des fSquid 6 s: Etude du DGveloppement de ses Plissements (with Plate I and 17 Text-figures). By R. 
Anthony et J. De Grzyiiowski. 

The Urachus: its Anatomy, Histology and Development (with Plates I and II and 4 Text-figures). By R. Campbell 
Begg, M.C., M.A., M.Sc., M.D., Ch.B., F.R.C.S. (Edin.), M.R.C.S. (Eng.), L.ll.C.P. (Lond.), F.A.C.S., F.C.S.A. 
On the Masticatory and Hyoid Muscles of Larvae of Xc nopus Larvis (with 3 Text-figures). By F. H. Edgeworth, M.D. 
Complete Double Aortic Arch (with 2 Text-figures). By Robert D. Lockhart, Ch.M. 

Some Notes on the Early Adrenals (with 2 Text-figures). By Dr M. S. Honan. 

Observations on the Mesenteric Circulation (with 3 Text-figures). By A. J. Cokkinis, M.B., B.S. (Lond.), 
F.R.C.S. (Eng.). 

Abnormalities of the Vertebral Column in a series of Skeletons of Bantu Natives of South Africa (with 13 Text- 
figures). By L. R. Shore, M.A., M.B., M.R.C.P., D.P.H. 

Axillary Mamma in a Man (with 2 Text-figures). By R. J. Gladstone, M.D., F.R.C.S. 

A Note on a Rare Type of Pterion in the Gorilla (with 2 Text-figures). By M. F. Ashlf.y-Montagu. 

Abnormal Duodenum (with 3 Text-figures). By Captain B. S. Nat, M.B., Ch.B., F.R.C.S., l.M.S. and P. D. 
Mookerji, L.M.S. 

The Brain of Fossil Animals—a Review. By Dr AriEns Kappeus. 


CAMBRIDGE UNIVERSITY PRESS, FETTER LANE, LONDON, E.C. 4 


ZEITSCHRIFT FUR MORPHOLOGIE 
UND ANTHROPOLOGIE 

UNTER MITWIRKUNG EINER ANZAHL VON FACHGENOSSEN 
HERAIJSGEGEBEN VON DR EITGEN FISCHER 

ordentl. Professor dor Anthropologic der Universitat, Berlin, Direktor des Kaiser-Williclm-Inwtituts 
far Anthropologic Bcrlin-Dahlem 

Band XXVII. Heft 2 (Preis Mk. 27.—) 

INHALT ERSCHEINUNGSWEISE 

Von der Zeitschrift fiir Morphologie und Antliropologie 
erschcinen jalirlich ca. 2 stattliche Bande, mit einem 
Umfang von je ea. 80 Bogen. Drei Hefte bilden einen 
Bund. Die Arbeiten sind mit Text- und Tafelabbild- 
ungen in bester Ausffihrung versehen. Der Preis der 
Hefte richtet sich jeweils nach Umfang und Ausstattung, 
Der komplette Band stellt sich auf ca. RM. 75. — bis 
RM. 80.—. Ausser Original-Arbeiten aus den Gebieten 
der vergleiclienden Morphologie und Anatomic, Rassen- 
kunde, Erbbiologie und Eugenik, bringt die Zeitschrift, 
—wie nebenstehende Inhaltsangabe zeigt,—Bticherbe- 
sprechungen,die fiber die einschiagigen Neuerschcinungen 
laufend unterrichten, 

Ausfuhrliche Inhaltsverzeichnisse fiber die vorange- 
gangenen Bande der Zeitschrift werden an lnteressenten 
gerne kostenlos abgegeben, ebenso unser Verlagskatalog: 
“ Anatomie-Anthropologic.” 

E. SCHWEIZERBART’SCHE VERLAGSBUCHHANDLUNG 
(Erwin Nagele) G. m. b. H. Stuttgart 
Johanncsstrasse 8* 


Berliner, M. Blutgruppenzugehfirigkeit und Rassen- 
fragen. Mit 1 Tafel. Verschuer, O. v. Zur Fragc der 
Asyinmctrie des rnenschlichen Kfirpers. Mit 5 Textabbild- 
ungen. Kiffner.Fr. Ein Beitrag zur Morphologie der 
Sakei. Mit 5 Tafeln, 5 Tabellen und 8 Textfigurcn. 
Mijsberg, W. A. Die Obliteration der Nahte des Gesichts- 
schadels bei den Javanern. Eckardt, H. Vergleichende 
morphologisehe Studien an den Molaren des Orang-utan 
und des Gibbon. Mit 21 Textabbildungcn und 5 Tafeln. 
Fischer, E. Zur Fragc einer athiopisclien Basse. Mit 5 
Tafeln. Bucherbesprechungen. 


(viii) 




BIOMET RIKA 


A JOURNAL FOR THE STATISTICAL STUDY OF 
BIOLOGICAL PROBLEMS 

FOUNDED BY 

W. F. R. WELDON, FRANCIS UALTON and KARL PEARSON 

EDITED BY 

KARL PEARSON 

ASSISTED BY 

EGON SHARPE PEARSON 

VOLUME XXI 

1920 


ISSUED BY THE BIOMETRIC LABORATORY 
UNIVERSITY COLLEGE, LONDON 
AND PRINTED AT THE 
UNIVERSITY PRESS, CAMBRIDGE 



With this volume—the twenty-first —JUornetrika reaches its majority, but 
it is the twenty-ninth year of its life. A history of its inception and early 
struggles by the Editor of this Journal will be found in the forthcoming 
third volume of The Life , Letter's, and Labours of Francis G niton . 


PRINTED IN GREAT BRITAIN 



CONTENTS OF VOLUME XXI 

Memoirs 

I. Tables for ascertaining the Significance or Non-significance of 

Association measured by the Correlation Ratio. By T. L Woo . 
Introduction to Tables, pp. 1— 8 ; Tables, pp. 9—66. 

II. A Contribution to Basque Craniometry. By G. M. Morant, D.Sc. 

With five plates, three figures in the text, a folding table and 
three cranial contours in pocket at the end of this volume . 

III. On Measurement of the Internal Diameters of the Skull in relation 

(I) to the Prediction of its Capacity, 

(II) to the “ Pre-eminence ” of the Left Hemisphere. 

By M. F. Hoaomcy and Kari, Pearson. With two diagrams 
(one folding) and four folding sheets of tables .... 

IV. On the Distribution of the Ratio of Mean to Standard Deviation in 

small samples from Non-normal Universes. By Paul R. Rider, 
Ph.D. With five figures in the text. 

V. Albinism in Dogs. By Karl Pearson and C. H. Usher. With 

two pedigree plates and eight plates (two coloured) 

VI. On the Distribution of the First Product Moment-Coefficient, in 

Samples drawn from an indefinitely large Normal Population. By 
Karl Pearson, G. B. Jeffery, F.R.S., and Ethel M. Elderton. 

With thirteen figures in the text. 

Tables of the Product Moment-Bessel Function . 

VII. Laplace, being Extracts from Lectures delivered by Karl Pearson 

VIII. Les Origines de Laplace. Sa G 6 n<klogie, ses Etudes. Par L’Abb^ 

G. A. Simon. With a pedigree plate. 

IX. Studies in the Theory of Sampling. By Joseph Pepper, B.A., B.Sc. 

With three figures in the text. 

X. The Distribution of Frequency Constants in Small Samples from 

Non-normal Symmetrical and Skew Populations (2nd Paper: 
Distribution of “ Student’s ” z). By Eoon S. Pearson, D.Sc., 
assisted by N. K. Adyanthaya, B.Sc., and others. With six 
figures in the text. 


PAGE 

1 

67 

85 

124 

144 

164 

194 

202 

217 

231 

259 



iv 


Contents 


PAGE 

XI. Sampling when the Parent Population is of Pearson’s Type III. 

By Cecil Calvert Craig, Ph.D.287 

XII. Note on Dr Craig’s Paper. By Egon S. Pearson, D.Sc. . . 294 

XI [I. On Racial Differences in Stature Long Bone Regression Formulae, 
with Special Reference to Stature Reconstruction Formulae for 
the Chinese. By Paul Huston Stevenson, M.D. . . . 303 

Note by K. P. . . . . . . . . . 319 

XIV. Measurements of Macedonian Men. ByMARGARETM. Hasluck,B.A. 

and G. M. Mgrant, D.Sc. With four plates, one figure in the 
text and folding map ......... 322 

XV. Some Notes on Sampling Tests with two Variables. By Egon 

S. Pearson, D.Sc. With three figures in the text . . . 337 

XVI. Inequalities for Moments of Frequency Functions and for various 

Statistical Constants. By J. Shohat (Jacques Ciiokhate) . 361 

XVII. Note on Professor Shohat’s Paper . . . . . . .370 

XVIII. On the Standard Error of the Mean Square Contingency. By 

Professor T Kongo.376 

Miscellanea 

(a) Note on Variability in Girls and Boys (Glasgow) for Height and 

Weight. By Ethel M. Elderton.429 

(ft) Note on a Paper published in Biometril'a, Vol xix. By J. O. Irwin 431 

List of Plates, etc . 

Biometrika Portrait Scries, No. VI. Pierre Simon Laplace . Frontispiece 
(a) (J. M. Movant : A Contribution to Basque Craniometry. 

Table of Absolute and Indicial Measurements of 39 Male 

Spanish Basque Skulls from Zaraus in the Mus6e Broca . to face page 82 

Plate I. Skull of Adult Male Basque. Norma facialis . „ „ „ 84 


Plllto II. „ , 

> ,, 

,, 

,, 

Norma lateralis 







(Left Profile) 

99 99 


Plate III. , 

3 99 



Norma verticalis 

ft 91 

>» 

Plate IV. „ , 

3 M 


„ 

Norma basalis 

99 99 


Plate V. ,. , 


M 

„ 

Norma occipitalis . 

*9 91 

>» 


Three Tissues, Median Sagittal, Transverse and Horizontal 
Contours of Basque Type Skull in pocket at end of volume. 




V 


List of Plates, etc . 

(f3)Hoadley and Pearson: On Measurement of the Internal 
Diameters of the Skull. 

Fig* 1 . Internal Lengths of Right and Left Hemispheres to 

the unit 0*6 mm. of Measurement. to face page 110 

Four folding sheets of Tables of Cranial Internal and External 

Diameters and Capacities, Tables V—XX 122 

( 7 ) Pearson and Usher: Albinism in Dogs. 

Pedigree I. Showing the Interrelationship of three Types of 

Albino Dogs (Pearson’s Dogs). to face page 146 

Pedigree II. Showing the Interrelationship of three Types of 
Albino Dogs (Usher’s Dogs).„ „ 157 

Plate I. Dondos, or pure-bred Pekinese Albinos „ „ 162 

Plate II. Cornaz Albinos from pure-bred Pekinese Albinos . „ „ „ 

Plate III. Coloured Sketch of Skewbald Cornaz Albino Fe , 
from pure bred Pekinese Albino (Dondo) Jill x pure-bred 
Pekinese Albino (Dondo) Jack .„ „ „ 

Plate IV. Albino Litters from extracted Albino x pure-bred 

Pekinese Albino.. „ 

Plate V. Cornaz Albinos, product of the mating of a pure¬ 
bred Pekinese Cornaz ( Wang) with a black bitch (from 
white pure-bred Pekinese Albino x black Pompek, i.e, a 
product of pure-bred Pekinese Albino x pure-bred black 
Pomeranian).„ „ „ 

Plate VI. Left: Skin of piebald puppy from Dondox Pompek, 
eyes and hair with many pigment granules. Right: Skin 
of Skewbald Cornaz Albino puppy with albinotic eyes, pale 
buff (“ grey ” or “ lilac ”) patches with diffused pigment and 
a few granules (?).„ 

Plate VII. Juby , Skewbald Cornaz Albino, and Wu, pure white 
extracted Albino. Parents of a coloured litter 

Plate VIII. Coloured Diagram to indicate the colour results 
of mating a pure white Albino with a Skewbald Cornaz 
Albino.„ 1 , 

( 8 ) L'Abbi 6 r. A . Simon: Les Origines de Laplace. Sa Gen 6 - 

alogie, ses Etudes. 

Pedigree Sheet: La G^n 6 alogie de Laplace . . . . to face page 217 




vi 


List of Plates, etc. 


(e) Hasluck and Murant: Measurements of Macedonian Men. 

Folding Map of South-West Macedonia in 1923 . . to faoe page 328 

Plate I. Above: Christian Greek (Kozani). Below: Greek¬ 
speaking Mohammedan (Chotil).. u 336 

Plate II. Above: Mohammedan Turk (Hadovo), probably 
pure Asiatic type. Below: Mohammedan Turk (Sofular), 
ancestry probably intermarried with Christian . . . „ M 336 

Plate III. Above: Mohammedan Buigar (Kosturyan in 
Karajova). Below: Christian Buigar of stock type (Pateli) „ „ 336 

Plate IV. Above: Christian Vlach of usual dark type 
(Mejidieh). Below: Christian Vlach of fair type (Samarina) „ „ 336 




1749—1827. 

Hus delicate calculus applies to the most important questions in life, which arc 
for the most part only problems in probability.” Tlieorie analytique des 
Probubilitex. 


tiiomctnbt" Portrait Series, No. VI. Issued with Vol. XXI. 


















































volW xia' 1 


' DECEMBER, 1920 


BIOMETRIKA 


TABLES FOR ASCERTAINING THE SIGNIFICANCE OR 
NON-SIGNIFICANCE OF ASSOCIATION MEASURED 
BY THE CORRELATION RATIO. 


(Introduction to Tables for if computed by Dr T. L. Woo.) 


( 1 ) Introductory. On the Distribution of f for the case of Independent Variates . 

In a recent paper* Harold Hotelling has obtained the frequency curve for the 
distribution of if, the square of the correlation ratio, subject to the conditions: 

(a) Independence of the variates in the sampled population. 

(ft) An “ indefinitely large * sampled population. 

(c) Normal distribution of the two variates. 


This frequency curve is: 

r 


z = 


n-8 N-n- 


•(i). 


T(*(n-l))r (*(#-»))’ 
where N is the size of the sample, and n the number of arrays on which rf 
is based. Conditions (a)—(c) of course very much limit the field of application 
of the above formula, the more so as if is generally used to investigate dependence, 
when the distribution is not certainly normal. Still there appears to be a number 
of cases, in which the primary variable is given in broad categories, but the 
secondary variable in quantitative measure, where the above result may be of 
considerable service, and we know so little about the distribution of rf, that any 
contribution to our knowledge is of value and likely to be suggestive. We have 
accordingly written down some of the values which flow from the above result 
and are likely to be useful to the practical statistician. 


Modal value of f, or rf, 
Mean value of 17 * or 


_ n — 3 
“ N - 6 
n — 1 


t. 


_ * 2 n— 1 n — l\ 

n-i) 


2 ^( 1 -^) 

ir+T“ 


* Proceedings of the National Academy of Sciences , Vol. n. pp. 657—662, Washington, 1926. The 
game form was reached by R. A. Fisher three years earlier (Journal of the Royal Statistical Society , 
Vol. lxxxv. p. 606, 1922). Fisher's proof may be more general than Hotelling's, although we must 
oonfess to it harder to foikrw. Hotelling assumes his variates have a normal distribution; 

Fisher that the arrays for which 9 * is computed are normally distributed. Both deal only with the 
case in which the variates are in the sampled population uncorrelated. 

t The value in customary use* is , which is, of course, practically identical with the above 

for reasonably large samples. 
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2 Tables for ascertaining the Significance of the Correlation Ratio 


hence 
Again: 

Further: 


_ u « - 1 fi n - 1 ^ n n-l\ 1 

,M3 iv-l/v 1 *N- i) (N + 1 )(N + 3) 


_8^(1 -i?)(l-2ip) 
(iV+1)(A+3) ' 


») — 1 
jV 


(JV + l)(A r + 3)(i\T + 4) 


1 .( , -^i)< jr+3,+4 ( 1 - 2 ^)’ 

- m iHj* shLh -»>(*+»>+*<> - vn 

Thus all the moment coefficients can be expressed in terms of 5j a . From these 
results flow: 

N+l 1 (1 -»ff 

Pl "N + SN + fir^ii-^y 

a .-»* +1 (:i 4 - 4(1 ~ 2 ^ 2 ^ 

Pi! N + 5 \ i (N + 3)^(1-ip)/’ 


or, 


* _Q^.+i^ 3Ar + s « 
^ 2_3 i^4 : 5 + 2"iV + 5 /91 - 


The latter equation shows that for iV very large 

2ft = 6 4- 3ft, 

or, the line for ft, /3 2 always lies above, but approaches, as A increases, the line for 
curves of Type III. Clearly, whatever be the value of v, the fts lie on a straight 
line nearly passing through the Normal Point and of slope less than 1*5. The 
order of & is best seen from the equation : 

a _ h n + 1 S-l (1-2ip) a 1 
1 N+ 3'N + ii 1-ip n —1 ’ 

or, as N grows large while n remains finite, this approaches the limit, ft = — 8 -j . 

For the usual range of values of «, ft will be of the order 2-0 to 0-5, i.e. fairly 
considerable. Accordingly ft will differ somewhat widely from 3. Thus, for all 
samples, the curve of distribution will be a leptokurtic curve differing very 
sensibly from symmetry, owing to the (in practice) limited number of arrays. 

(2) The above values for if must be clearly distinguished from those for »/. 
The frequency curve for i) is: 
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Modal value of 77 , or rj, 


Mean value of 17 , or rj, 
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_n-l n +1 
N-1N +1 + 


2n] (1 ' 

(v?-S(vY. 


From these equations ff t and /? 2 can be found and hence the distribution of r) 
appreciated in the usual way. The process, however, appears to have no advantage 
over the discussion of the distribution of rj 2 . It is of interest, however, to note 
what error results from supposing rj = vV and r? = (rj) 2 ' It is worth noting that 
<r, = V^-tu) 2 


Table of rj and rj* as compared with flf and (rj) 2 - 



Number of Arrays 

Size 

of 

Population 

n 

= 5 


71 = 

10 

71 = 

20 

50 

ij =-2699 
± '0630 

(7 ?■=■ -0729 

n « 

•4190 
± *0901 

(7)2 ='1750 

ij — *0177 
± *0787 

(7)* = -3816 


\^2=-2857 

ij 2 =*0816 


•428G 

7 2 = '1837 

^2=-6227 

7 2 =-3878 

100 

7 = *1894 
± *0078 

(7)2=-0359 

rj = 

•2940 
± ’0608 

(7) 2 -'0864 

7 «*4,335 
±•0635 

($)*«-1879 


vV=-2010 

7 2 «-0404 

\^rj “ *= 

•3015+ 

7 2 = -0909 

7/72=-4381 

7 2 --1919 

500 

ij = -0842 
± -0304 

(7)*= -0071 

v = 

•1307 
± *0309 

(7) 2 =-om 

7 =-1927 
+ •0308 

(7)2=-0371 


vV 2 = -0895+ 

ij' 1 = '0080 

'ft- 

•1343 

7 2 = '0180 

N/7 a =-1951 

7 a - -0381 

1000 

ij = *0595 ~ 

± *0210 

((j)*«'0035 + 


•0923 
± -0219 

(7)*-'0085+ 

7 —*1361 
± *0221 

(7)2= -0185+ 


0633 

7 a -'0040 


•0949 

7 2 =’0090 

V7 2 «-1379 

7 a m *0190 


Thus for N= 50, the difference ranges from "005 to '016 for rj and V^ 2 , and 
from ‘006 to 009 for (rj)* and rj 1 . As there is less range in the variation of 
(rj ) 2 and i f, and as it is rj* which arises firet in our calculations, the tables which 
accompany this introduction are adapted to rj*. 
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(3) The conception in the tables is very simple. Starting from Equation (i) wo 
measure the ratio of the area of the curve beyond rf**if + Xtrf to the area of the 
whole curve; this is the “probability integral” of the frequency distribution of if. 
Actually X is given two values which will give P approximately the values 
•01 and 02. To make P exactly *01 and *02 would have involved five or six 
times the amount of calculation. Actually the values selected for X bring P 
sufficiently near # 01 and *02 to allow us to determine whether if may be con¬ 
sidered to differ significantly from if. If P lies above *02, i.e. 1 in 50, then we 
cannot emphasise the difference of ij* and if as probably having significance; if P 
is below *01, then we usually claim significance for if. Between these values 
differences may be of a doubtful character, and really indicate that our sample 
is hardly large enough to admit of a definite judgment of significance. 

It will be noted that 

X — Observed f — Mean if 
~~ Standard Deviation of if * 


a function familiar enough in the case of the probability integral of the normal 
curve. In that case X = 2*33 for P = '01, and = 2*05 for P = '02. A comparison of 
the values for X in the accompanying tables indicates how far the distribution of if 
on the side towards unity differs from a normal distribution; the divergence 
when n and N are not small is not very great. 

We have no need to consider deviations of if from if when X is negative. 
For in such cases if is less than the value if which is the mean value of if for 
no correlation, and we cannot therefore predict any significance for it on the 
basis of our size of sample. 

The tables give in the first column if , and in the second «, both on the 
assumption of zero association and of sampling from a normal population. The third 
and fourth columns give two values of X corresponding to two values of P approaching 
respectively *01 and *02. It was not possible to deduce the whole table from the 
Tables of the Incomplete Beta Function, because the latter are confined to the 
range of B ( p , q ), in which p and q arc both 50 or less. The Incomplete Beta 
Function Tables were accordingly only used for checking some of the lower values. 
Weddle’s quadrature formula was also used for checking. The values of P for 
given values of X were obtained for odd values of n by means of the formula: 




s-l N-2s-l x j 

1! A-2s + V 21 #~2« + 3 

(s-l)(s-2)( 5 -3)iYr-2s--l 1 

31 N — 2s4^5 1 ~ + - 


a -*y 


s-lN-Zs-l (a—1)(»—2) N—28—1 (s-1)(s-2)(*-3) N-2s-1 

1! N — 2s+ 1 + 2! 2V-2#+3 3! N-2s+5 + 


1 



T. L. Woo 


6 


where n ® 2s «f 1 . For n odd both series are finite and as n was not taken greater 
than 21 ,8 did not exceed ten, nor the number of terms in numerator $nd denomi¬ 
nator exceed ten each. The coefficients depend only on N and thus the powers 
(1 — z)* could be relatively easily modified until a suitable value of P was found. 
No attempt was made to reach exact values for P. The values given to N were 
fairly close together until N = 100 , when an argument change of 50 was found 
adequate for graphical interpolation. This interpolation was carried out by 
Miss Ida McLeara. 

An appropriate system of Vs and Ps having thus been determined for all 
values of N from 50 to 1000 , and for all odd values of n from 3 to 21 , it was 
needful to determine the corresponding values of \ for the even values of n. 
This task of rather troublesome graphical interpolation was kindly undertaken by 
Dr E. S. Pearson. The values of P and \ are only tabled to three and two 
decimals respectively. This is as much as the processes adopted justify, but such 
approximate values of P and \ are adequate for the end in view, i.e. to deter¬ 
mine whether if differs significantly from if, the mean value of 17 2 when there is 
no association. 

In the following tables if denotes the mean value of if when there is no corre¬ 
lation, i.e. it is not ( 17 )*, but printed briefly for (if ), and in order to indicate that 
<r,« is not the general standard deviation of 17 s , whether or no there be correlation, 
but only when there is no association between the variables, we have printed 
for no association in the population sampled as oy, thus linking it up with if, 
which is now in pretty wide use—not for the mean of any if— but for the mean 
when we suppose no association. 

(4) Illustrations . Throughout wo shall use the observed value of if uncor¬ 
rected for number of arrays. 

(i) Influence of Crowding on General Astigmatism . We require the corre¬ 
lation ratio 7 ) of general astigmatism on crowding, i.e. number of persons per 
room, Va.p* Our observations are on 716 schoolboys*, and we have for eight 
arrays: 

V*a.p = ' 022,821, *009,790, oy- *005,200, 

V* = (*Pa.p - V % A.p)l°i? 8=1 2-51. 

This value of \a for the data is a trifle lower than the value of \ a =2*54, and 
accordingly by rough extrapolation it would be about once in 45 or 46 trials that 
such a value of v 2 a.p would occur, if there were no association between general 
astigmatism and crowding. It is possible that there may be some association but 
no great stress could be laid on the result. 

(ii) Influence of Familial Income on Corneal Astigmatism. We wish to con¬ 
sider the influence of poverty or its reverse on corneal astigmatism. Our data f 

* AnmU of Eugenia, Vol. m. p. 89, et teq . 
t Ibid . p, 46. 
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consist of 228 boys arranged in nine arrays, if we include the “ comfortable group ” 
as one array. We have 

V*ca.i -*139.397, ?«./--085,242. a,* =-017,232, 

\d « {v 2 ca.i-V*ca. j)/<V = 6*04. 

This value for the data is much above \ x and the odds against r) 2 being a result of 
random sampling are very much greater than 99 to 1. 

(iii) Hours of Homework and Age . Here we have data for 322 boys*, and 
find for eight arrays: 

ifu.A — ‘027,433, v 2 h.a = 021,807, <7^== 011,493, ^ = 0*49. 

\ d is accordingly much below X 2 ( == 2*54), and it does not appear that older boys 
work longer hours. 

(iv) Distance of Nearpoint and Colour of Iris . The data are for 770 boys in 
seven arrays of eye-colour determined by Martin’s scale f. We have 

V 2 np.ec= 021,727, v^nr.ec * *007,802, a* = *004,481, 

and accordingly 

x _-021,727--007,802 
Xd ~ m *004,481 ~' 5 J ’ 

and therefore by rough extrapolation 1 J = about *009, or the odds are about 
111 to 1 against rj 2 for these data having arisen from a population in which 
distance of nearpoint and eye-colour are unassoeiated. 

(v) Mental Capacity and Place in Class. 249 boys arranged in four classes, 
Excellent, Good, Moderate, Dull. The data gavef: 

V 2 i\ i = *525,045, = *012,097. af = *009,778, 

X d = (*525,045 - *012,097)/*009,778 = 5*25. 

This value of A ri is so far in excess of \ x that we have no hesitation in asserting 
significance. The reader will see that while we can on the basis of our tables 
predict significance, it would need a far larger series of values for P and X, for each 
value of n and N to obtain even a rough measure of the actual degree of signi¬ 
ficance. The labour of calculating such tables would be excessive, and when 
calculated the cost of printing would render publication impossible. It is for 
these reasons that we have limited our values of P to two in the neighbourhood 
of those where most practical statisticians would consider significance to be 
assured. 

In this particular case, although our data are sparse and the arrays few, we 
may certainly assert that the uncorrected correlation ratio of *7246 is very 
definitely significant. 


* Annals of Eugenic #, Vol. hi. p. 75. 
f Diometrika , Vol. vui. p. -644. 
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(vi) Intelligence Test Marks and Teacher's Estimate of Intelligence*. The 
correlation ratio for a table containing 63 girls, divided into fottr arrays, gave: 

M’.e.i 338 *335,825, V^t.m:E' i 48 *048,387, *» *037*933* 

Hence \a = 7*58, which is far above A* * 3*20. The chance that the association is 
merely due to random sampling from unassociated material is extremely slight. 

Dr Isserlis gives another table (Table I) for the correlation between School 
Examination Marks and Intelligence Test Marks. This is based on only 50 girls, 
and falls outside the present tables, yet the constancy of \ and P for a given 
number of arrays is so great that it is easy to test the significance. We have for 
five arrays: 

V 2 t.m.s.m *164,761, ~ *081,633, cr^ = *054,221. 

Hence \i = 153. This is considerably below \ 2 = 2*68, or the odds against 
such a value of rj arising from random sampling are very Small. In other words, 
the data are too sparse for us to predict whether there is any relation between 
School Marks and Intelligence Test Marks. 

(5) Fisher has given f for the case of no correlation and normal distribution of 
the variates the following frequency curve for the distribution of the square of the 
multiple correlation coefficient (Z 2 2 ). 

Probability that R 2 lies between R 2 and R 2 + dR 2 

- _ JT (i 1 )i_. — ( 7?2\£(n-2) /1 _ Jp)W-n-VdR* (iii) 

r ($ (tf - ft - 1 )) r (J n) Klt) U n) a K h 

Accordingly, if we write in the Equation (i) to the frequency curve for rf above, 
n + 1 for ??, we obtain a curve identical with that for R 2 , or the tables for the 
determination of the significance of rf can be used for the significance of R 2 pro¬ 
vided we enter these tables with n taken equal to the number of the variates 
w l9 ... x n on which R is based plus unity, i.e. if x 0 be the variate, which we are 
multiply correlating, we must enter the tables with the total number, n + 1 , of 
variates concerned, x 0 , x t , x 2 ... x n . We thus obtain the mean value of 72 s by the 
corresponding if, and the standard deviation, <r^, of R 2 by the corresponding 
value of <Tjja, 

Ilhistration (vii). Rainfall in relation to Longitude , Latitude and Altitude . 
The data referred to 57 recording stations in Hertfordshire, and FisherJ found 
R 2 = *4431. Here N = 57, n = 4 (longitude, latitude, altitude, and rainfall) and 
thus our tables give: 

R 2 as *0536, sss *0418. 

It follows that: \d~ (*4431 — *0536)/*0418 = 9*32. This is nearly three times as 

* 11 The delation between Home Conditions and the Intelligence of School Children.” By L. Isserlis, 
D.So., Medical Research Council, Special Report, Series 74. 
t Phil. Tram . Vol. 218, B. p. 91, 1924. 

$ Statistical methods for Research Workers , pp. 185 and 228. 
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great as \ x for which the chance is about 1 in 100. Hence the multiple correlation 
found is certainly significant. 

Readers of this paper and users of the present tables are once again warned of 
their limitations. The theory on which they are based depends upon our sampling 
being made from an indefinitely large normal population; the argument is 
based, in the case of both rj* and iJ 2 , on the improbability of the observed result, 
supposing the variates are uncorrelated . The tables tell us the probability of 
association, but if we conclude that the variates are correlated, they really throw 
no light on the closeness with which our sample value of the association probably 
approaches the actual association in the sampled population. The distribution of 
rj 2 t even for a normal surface, when correlation does exist would be of undoubted 
service; but we cannot overlook the point that the chief value of the correlation 
ratio arises in cases where we have at least grave doubts as to the nature of the 
regression being linear. 

The illustrations provided should suffice to indicate the method of using the 
tables and their value in saving labour. 



TABLES FOR ASCERTAINING THE SIGNIFICANCE OR 
NON-SIGNIFICANCE OF ASSOCIATION MEASURED BY 
THE CORRELATION RATIO 


fitooMtribxzi 



= number of arrays 
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$4 Tables for ascertaining the Significance of the Correlation Math 
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A CONTRIBUTION TO BASQUE CRANIOMETRY. 

By G. M. MORANT, D.Sc. 


For a number of reasons the ethnology of the Basques of the south-west of 
France and the north of Spain is of particular interest. Cultural peculiarities, and 
the fact that the Basque language is structurally different from all others of Western 
Europe, suggest that the people have been isolated for a long period, but their 
origins are not disclosed by any historical records. The earlier attempts to discover 
the relationships of the race were based almost entirely on philological evidence 
and the most fantastic and often diametrically opposed theories have been brought 
forward by different writers*. At one time or another the Basques have been 
supposed akin to the ancient Egyptians, Guanches, Berbers, Etruscans, Phoenicians, 
Lapps, Finns, Bulgarians, or to Asiatic races; others have seen in them the unique 
descendants of a prehistoric population of Europe, or the sole survivors of Atlantis! 
The first anthropological contribution to the subject which is of any importance 
was made by Paul Broca in 1862 f. He describes a series of 60 skulls from Zaraus 
which had been presented to the Paris Anthropological Society by Gonzales Velasco. 
“Ces cr&nes,” Broca writes, “ont. 6t£ extraits sans aucun choix, et dans l’ordre ou le 
hasard les pr6sentait, dun cimetifere de la province de Guipuzeoa (Espagne), dans 
une petite locality oil les Basques, depuis les temps historiques, n’ont subi aucun 
melange de race.” The mean cephalic index was found to be 77*7. The only other 
measurement considered in this paper is the cranial capacity given as 1486 9 J. The 
inionic protuberance and the imprint of the muscles of the neck were observed to 
be particularly feeble. In the following year Broca published another paper on the 
Zaraus skulls controverting the opinion expressed by Pruner-Bey that they were 
racially heterogeneous§. A number of additional measurements of the unsexed 
series are given and it is suggested that the Basques are most closely related to the 
peoples of the north of Africa. It is Broca again who makes the next important 
contribution to the craniology of this race||. His paper deals principally with 
a series of 58 Basque skulls from the French town of Saint Jean-de-Luz, which is 

* An interesting account of a number of these theories is given by W. Z. Ripley in Chapter vnr of 
The Race* of Europe . 

f “ Sur les caraot&res du cr&ne des Basques.’’ Bulletin* de la SociStS d* Anthropologic de Part*, t. hi. 
1862, pp. 679-697. 

£ This value is probably too high. A criticism of Brooa’s method of determining eranial capacities 
is given in Alice Lee and Karl Pearson: “Data for the Problem of Evolution in Man. VI. A First Study 
of the Correlation of the Human Skull/’ Phil. Tran*. Royal Society , London . Series A. Vol. 196,1901, 
pp. 226—264. 

§ “Sur les cr&nes basques.” Bulletin* de la Socidtf d y Anthropologic de Paris, t. iv. 1868, pp. 88—72. 

|| “Sur les cr&nes basques de Saint Jean-de-Luz.” Ibid. 2° s6rie, t. m. 1868, pp, 48—101. 
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A Contribution to Basque Craniometry 

less than 30 miles from Zaraus, though separated from it by the Pyrenees. The 
cemetery in this case had been disused by A.D. 1532. The mean cephalic index of 
the 57 French specimens is given as 80*25 which is significantly greater than the 
value for the Spanish collection. Numerous other measurements given for the two 
unsexed series suggest that other characters also differ significantly, though, m 
general, the means agree closely. The collection from Zaraus had been augmented 
in the meantime by the addition of 19 skulls*. The mean cephalic index of these 
is given as 76 0, which is not significantly less than the value of 77*7 found for the 
original 60. 

In 1925 the present writer was able to study the Spanish Basque crania preserved 
in the Mus6e Broca f and he re-measured and drew contours of the 39 specimens which 
had been supposed male by Broca. Individual measurements and means are given 
in Appendix II below. This partial duplication of a somewhat laborious task was 
desirable for a number of reasons; individual measurements of the skulls have not 
previously been published, the unsexed means given by Broca do not serve modern 
requirements and there is no indication in his published papers of the number of 
specimens on which each is based; also, a number of additional measurements and 
type contours are now given J. 

Since Brocas day a considerable number of living Basques have been measured, 
chiefly by Aranzadi in Spain and Collignon in France. Their studies have shown 
that there is a real difference between the cephalic indices of the populations on 
the two sides of the Pyrenees, as had been suggested by the cranial series, but 
nearly all other characters are closely similar and there is every reason to believe 
that the Spanish and French Basques are varieties of the same race§. There appears 
to be a fairly general agreement in relating it most closely to ancient Egyptian and 

* Paul Broca: “Cr&nes basques de Zaraus.” Bulletins de la Socittt d'Anthropologic de Paris , 2® s6rie, 
t. i. 1866, pp. 470—473. 

t Ho was indebted to Professor L6once Manouvrier, the late Secretaire g6n6r&l of the Sooi4t6 d’An- 
thropologie de Paris, for permission to undertake this study. 

£ It must be admitted, too, that there is sufficient reason to question the accuracy of some of the means 
given in Broca’s published works on this subject. The mean basic-bregmatic height of 128*75 given for 
the 60 unsexed Zaraus skulls on p. 58 of the 1868 memoir and on p. 64 of the 1868 memoir is an almost 
impossibly low value. It gives a height-length index of 67*4, which appears to be smaller than any 
other mean recorded for a cranial series. For the 89 male specimens from Zaraus I find a mean height 
of 130*8 and the height-length index is 70*5: these values were checked by measurements of the sagittal 
type contour. Again, on p. 80 of the 1868 memoir the mean nasal height is given as 42*85, which 
eads to a nasal index of 52*6. This is higher than any other nasal index recorded for a European race, 
whereas it is clear that, in reality, the Basque nasal index is extremely low. Corrections of these 
obvious inaccuracies would give a better correspondence between Broca’s unsexed means of the Zaraus 
and St Jean-de-Luz series. 

§ In an article in L’Anthropologie (t. v. 1894, pp. 276—287), in which Collignon summarises the 
results of his important memoir published in the Mtmoires de la Sociiti d'Anthropologie de Paris 
(3 e s4rie, t. i. 1895), this writer remarks (p. 286): “...Broca, ne jugeant la population basque fran^aise 
que d'apr&s des or&nes provenant de la plus deplorable locality qu’il ffit possible de ohoisir 4 ce point de 
vue (St Jean-de-Luz) d’une ville cosmopolite par excellence depuis plusieurs si&cles, appliquait & tort 
l’impression, exaote d’ailleurs, qu’il ressentait au reste du pays...,” The variabilities of the male 
St Jean-de-Luz series could be calculated from Broca’s manusoript records, and it would be interesting 
to compare them with the values given for Spanish Basque crania in Table III below. 
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modern North African races. Apart from the measurements of very small numbers 
of skulls, no further advance in Basque craniology appears to have been made until 
1892*. A few measurements of 489 Spanish crania in the Museum of Madrid were 
published then: they comprise the sexed distributions and means of the cephalic 
and nasal indices for different provinces, and the means only (in whole millimetres) 
of the calvarial length and breadth. There are 46 male skulls from Guipuzcoa and 
6 from Navarre and the means agree excellently with those of the series from 
Zaraus which is preserved at Paris f. A considerable number of sexed mean measure¬ 
ments of the skulls from Guipuzcoa at Madrid were published in 1913 J and in the 
following year the individual measurements of the specimens in the same collection 
were given for the first time§. There are 14 male and 15 female skulls from Zaraus 
and 23 male and 18 female skulls from neighbouring localities. Sexed means of this 
Guipuzcoan series are given in Table I below. A study of the facial triangle of 
Basque skulls was published by Aranzadi in 19171|. Individual measurements of 
the sides and angles of the triangle are given for 53 male and 40 female specimens 
from the provinces of Guipuzcoa, Visoaya, Navarre and Alava. There is a good 
agreement with the corresponding measurements of the Basque series at Parish. 
An inter-racial comparison made by Aranzadi in this paper shows that the Basque 
skull is peculiarly orthognathous: the chord from basion to alveolar point, the nasal 
angle and the gnathic index shown are almost, if not quite, extreme values for all 
races of man. Finally, in 1922, Aranzadi published a synthesis of his measurements 
of Basque skulls preserved in Spanish museums**. He deals with 82 male and 83 
female specimens, but unfortunately the individual measurements are not provided 
and many of the means are given to the nearest millimetre only and without any 
indication of the number of individuals on which each is based.* 

* L. de Hoyos Sainz and T. de Aranzadi: Un avanoe 4 la antropologfa de Espana.” Analen de la 
Sociedad (Espahola) de Historia Natural , t, xxi, 1892, pp. 1—71. 

f The male means are: 



Cephalic Index 

Nasal Index 


(100 BjL) 

(100 NB/Nir) 

Basques from Guipuzcoa and Navarre (Hoyos S&inz and 

76-6 (46) 

44-6 (45) 

Aranzadi) . 

Basques from Zaraus (Morant) . . 

77-2 (89) 

44-8 (85) 


X T. de Aranzadi: “ Cr&neos de Guiptizooa.” Asociacirfn Espahola para el Progreso de las Ciencias . 
Congreto de Madrid , 1913. 

§ T. de Aranzadi: “Sar quelques correlations du trou occipital des cr&nes basques.” Bulletins et 
Memo ires de la Soci4t£ d'Anthropologie de Paris , 6* s6rie, t. v. 1914, pp. 325—382. 

|| “ El tri&ngulo facial de los cr&neos vascos.” Memoriae de la Beal Sociedad Espahola de Historia 
Natural , t. x. 1917, No. 8*. 

II The male means are: 



G'U 

GL 

LB 

Njl 

A L 

Bl 

Spanish Basques (Aranzadi)... 
Basques from Zaraus (Morant) 

70-8 (53) 
70*7 (81) 

92*9 (53) 
90*7(87) 


02°-9 (53) 
61°'9 (81) 

74°-8(58) 
75 o- 0 (81) 

42°-8 (58) 
48 0, 1 (81) 


** “Sfntesis m&rioa de craneos vascos.” Revue Internationale des dtudes basques , t. xm. 1922, 
pp. 1—82. 
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TABLE I. 

Mean Measurements of Series of Basque Crania from Ouipuzcoa. 




Meawured by 


Character* 

Aranzadi 

Morant 

Aranzadi and 
Morant 


Female 

Male 

Male 

Male 

L 

179-0 (33) 

180*2 (37) 

185-8 (39) 

186-0 (76) 

B 

138-9 (33) 

142*8 (87) 

143*5 (39) 

143*2 (76) 

B" 

110*9 (33; 

120*2 (37) 

119*7 (39) 

119-9 (76) 

B' 

95-0 (33) 

96*6 (37) 

97*1 (39) 

96-9 (76) 

Bi-asterionic B 

108-9 (33) 

112*8 (37) 

113*2 (39) 

113*0 (76) 

Bi-auricular B 

120-4 (33) 

125*7 (36) 

[123"3 (37)]+ 

124*5 (73) 

//' 

12S-0 (33) 

131*8 (37) 

J30-8 (39) 

131*3(76) 

LB 

95-8 (33) 

101-5 (30) 

99*6 (39) 

100*5 (75) 

S 

303*5 (33) 

374*4 (37) 

375-2 (39) 

374*8 (76) 

Sx 

125*3 (33) 

128-8 (37) 

129*5 (39) 

129*2 (76) 


122-9 (33) 

127*6 (37) 

126*0 (39) 

126*8 (76) 


115-3 (33) 

118-3 (37) 

119-8 (39) 

119*1 (76) 

Broca’s Q' 

299-1 (31) 

307-1 (37) 

— 

307-1 (37) 

(7 label la U 

510-1 (33) 

527*3 (37) 


527-3 (37) 

fml 

34*0 (33) 

35*5 (37) 

36-4 (39) 

36*0 (76) 

fmh 

29-2 (33) 

29*7 (37) 

30-9 (39) 

. 30-3 (76) 

(I'll 

06-7 (19) 

70*3 (30) 

70-7 (31) 

70-5 (61) 

UL 

90*4 (19) 

93*3 (30) 

90-7 (37) 

91-9 (67) 

NIL' 

49*2 (33) 

51*1 (37) 

61-9 (37) 

61-5 (74) 

XB 

22-9 (31) 

22*7 (37) 

23-2 (35) 

22-9 (72) 

DC 

20*2 (33) 

20*9 (37) 

21-1 (37) 

21-0 (74) 


38-3 (32) 

38-3 (37) 

39-1 (35) B 

38-7 (72) 

°J 

33-5 (33) 
122-4 (32) 

32*6 (37) 
129*4(31) 

33-6 (35) 11 
128-8 (34) 

33-1 (72) 
129-1 (65) 

OB 

8(5-8 (33) 

89*1 (34) 

90-1 (31) 

89-6 (65) 

External bi-orb. B 

98-3 (33) 

101*5 (37) 

— 

101 -5 (37) 

Bi-jugal B 

104-7 (29) 

109*3 (34) 

- 

109-3 (34) 

Basio-palatal L 

42-6 (31) 

44*0 (37) 

[41-4 (36)]+ 

42-7 (73) 

100 BjL 

77-0 (33) 

76*7 (37) 

77-2 (39) 

77-0 (76) 

100//'//, 

100 Bill’ 

09-9 (33) 

70*9 (37) 

70-5 (39) 

70-7 (76) 

111-3 (33) 

108*4 (37) 

109-9 (39) 

109-2 (70) 

i 

8 

7-7 (33) 

5*9 (37) 

6-8 (39) 

6-4 (76) 

100 fmh If ml 
lOGO'/l/GB 

100 NB/NH' 

100 ajof 

84-3 (33) 

86*6 (37) 

86-1 (39) 

85-8 (76) 

76-9(19) 

78*9 (30) 

78-9 (26) 

78-9 (56) 

46-6 (31) 

44*3 (37) 

44-8 (35) 

44-5 (72) 

87-3 (32) 

85*3 (37) 

86-3 (34) B 

85-8 (71) 

100 GH\J 

55-0 (19) 

54*5 (25) 

55-2 (28) 

54-9 (53) 

Nl 

64”-9 (19) 

63” *0 (30) 

or-9 (31) 

62°"4 (61) 

A L 

73°-3 (19) 

74°*9 (30) 

75°-0 (31) 

75“-0 (61) 

Bl 

41°-8 (19) 

42"*1 (30) 

43°-l (31) 

42°-6 (61) 

Daubenton’s L 

-4°-6 (33) 

—1°*3 (37) 

[ + 0°-2 (36)]+ 

- 0°-6 (73) 


* The measurements are defined in Appendix I. 
t From type contour. 

The mean measurements of the male Basque series preserved at Paris and 
Madrid are given in Table I and it will be seen that all the differences between 
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corresponding characters are remarkably small. Actually no difference is significant, 
and there is complete justification for considering that the two samples were drawn 
from the same homogeneous population. The pooled means given in the same table 
are based on numbers of individuals which are adequate enough to provide a reliable 
description of the type. They indicate that the Basque skull is typically European 
in all respects: there is nothing to suggest that it is more closely related to any 
non-European types than any other Western European forms are. The characters 
of the calvaria are in no way peculiar. It has been suggested that the occipital 
index (defined in Appendix I) has lower values for European races than for any 
others in the world. The means available range from 58*0 to 68*3*. In this respect 
the Basques are precisely similar to neighbouring races. All means shown below 
are based on more than 30 male crania. It is interesting to find that this character 
makes a fairly definite distinction between the races of Western Europe (whether 
dolicho- or brachycephalic) on the one hand, and the Egyptian and modern races of 
Eastern Europe on the other. 


Oe. I. Cranial Series 


58— 51) Seventeenth ceutury English (Farringdon Street) 1 58*0, Walser (Vorurlberg)* 58i, 

Anglo-Saxons 58'2 3 , Basques 58*3, Bavarians (Alps) 4 590. 

59 — 60 Reihengraber 6 59’5. 

60— 61 Predynastic Egyptians (Naqada A and Q) 60'2 ,J , Swiss (l)aniser) 7 60-3, Czechs 8 

Guanche® GO 6, Egyptians (XVlIIth— XXth Dynasties) 10 60*7. 

61 — 6,1 Egyptians (XXVIth—XXXth Dynasties) 11 61*5, Abyssinians (Tigre) 12 62*0. 

6?—08 Rumanians 13 02*7, Serbo-Croats 14 028, Greeks 16 62*9, Turks 10 03*3. 

I Biometnka , Vol. xvm. p. 29. 2 ZeiUchrift fiir Ethnologic , Bd. xuv. S. 509. 

8 Biometnka , Vol. xvm. p. 82. 4 “ Beitrlige z. Anthrop. u. Urgesch. Bayern* , Bd. xvm. S. 1. 

B Pooled mean. Biometrika, Vol. xxb. p. 813. 0 Biometrika , Vol. xvn. p. 15. 

7 ZeitHrhrift /. Morph, u. Anthrop. Bd. xv. Tafeln, S. 544. 

8 Archiv f. Anthropologic , Bd. xxxix. S. 282. 

9 Detloff v. Behr: Me trine he Studien an 152 Guanchemtchildeln. Stuttgart, 1908. 

10 Hermann Stahr : Die Rassenfrage im antiken Aggpten. Leipzig, 1907. 

II Biometrika, Vol. xvi. p. 837. 13 Sergio Sergi: Crania Habesemica. Rome, 1912. 

Denksch. d. k. Afoul, d. Wmemch . lVien, Math. Nat. Kl. Bd. xxx, S. 107—130. 

14 Weisbach: Supp. 1884. ZeiUchrift f. Ethnologic , S. 70—2. 

18 Mitth. d. Anthrop. Gesellsch. in Wien, Bd. xi. S. 72. 10 Ibid. Bd. hi. S. 20G—213. 

Some of the facial characters of the Basque skull are more peculiar than its 
calvarial ones. The narrowness of the face is one of the most salient features of the 
living head for this race and the same narrowness marks the cranium. The breadth 
between the lowest points on the malar-maxillary sutures (GB) is 89*6 : the reliable 
male means nearest to this value available for other European types are 90*9 for the 
Whitechapel Englishf and 91*4 for the Farringdon Street English J, while 22 other 
series have values greater than 92. The Basque nasal breadth of 22*9 is also extreme : 
for 46 other European races the lowest male means are 23*1 for both French Soldiers§ 

* Biometrika, Vol. xvi. 1924, pp. 834 and 835. 

t Ibid. Vol. m. p. 208. 

§ See references appended to Table II. 


+ Ibid. Vol. xvm. p, 28. 
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and Lowland Scottish *. The Basque bi-zygomatic breadth of 129*1 is less character¬ 
istic than OB and NB } since all Egyptian types have this measurement below 129*0, 
and European races with smaller values are Sardinians! 127*7, Portuguese} 127*7 and 
Great Russiansg 128*8. Several Western European types have male means for this 
character between 130 and 131. The facial heights of the Basque type are not out¬ 
standing, so that some of the indices expressing the ratios of these heights to the un¬ 
usual breadths may be expected to be unusual also. The upper facial index (100 Q'H/GB) 
has the extreme value of 78*9. The closest approach is shown by the Wlirtemberger 
7811|, Austrians (Vienna) 77‘5V and Farringdon Street English 77*1** * * §§ , while 22 
other European series have lower values. The Basque nasal index (100 NB/NJET) of 
44*5 is low, but not extreme, as still lower values are recorded for Belgian Franks 
43*9*, Breton Gallots 44*3tf and Portuguese! 44*4, and the Lowland Scottish* have 
the same mean of 44*5. Forty-three other European series are found to have higher 
nasal indices than these. The marked orthognathism of the Basque skull has been 
insisted upon by Aranzadi and, like the narrowness of the face, this is a striking 
characteristic of the living head. The basal length (from nasion to basion) and the 
upper facial height (from nasion to alveolar point) are not unusual, but the length 
from basion to alveolar point (OL) again appears to be extreme. The male mean is 
91*9,and the lowest means found among26 other European series are 92*3 forCzechsJt 
and 93*3 for Modern Cretans§§. The Basque nasal angle of 62°*4 is just equalled by 
one recorded for Serbo-Croats||||, while the next smallest mean is 63°*3 for the Modern 
Cretans. The Basque alveolar angle of 75 o, 0 is large but not extreme, and the basal 
angle is not peculiar. The alveolar profile angle of 88 *7 appears to be the largest 
given for any race. It is particularly interesting to find this association in the type of 
an extremely narrow face and marked orthognathism. For 48 racial series from all 
parts of the world an inter-racial correlation of + *747 + *040 has been found between 
the nasal index and nasal angle f U. For some of these facial characters the Basque type 
appears to have values which are extreme for all modern races of man; such are the 
bi-maxillary and nasal breadths, the chord from basion to alveolar point and the upper 
facial index. Other characters which are almost extreme are the occipital and nasal 
indices and the nasal and alveolar angles. These facts do not dissociate the Basques 
from other European races. The types noted above which have characters approaching 
most closely to the extremes for the Spanish race need not be presumed to be the 
ones most nearly related to it; relationship can only be estimated'with safety by 
considering all the more important features of the skull. 

* Bee references appended to Table II. f ZeiUchriftf\ Morph . und Anthrop . Bd. xm. S. 444. 

J Ferraz de Macedo: Crime et Criminel. Lisbon, 1892, p. 52. 

§ M€moire* de V Academic Imptriale des Sciences de St Piterthourg, vn* s4rie, t. xxxn. pp. 1—81. 

|| Die anthropologischen Sammluvgcn Deuttchlands. Tubingen Catalogue. 

If ZeiUchrift der k. k. Geselltchaft der Arzteiv Wien, Medizinischc Jahrbilcher, xx Jahrgang. 0. i_iv, 

Biometrika, Vol. xvni. p. 28. 

+t Jieviu d'Anthropologic, t. ii. p. 627. Archivf. Anthropologic , Bd. xxxix. S. 281, et tea . 

§§ Zeittchrift fUr Ethnologic , Bd. xliii. S. 822—325. 

Ull Weisbach: Supp. 1884. ZeiUchriftf. Ethnologic , S. 66—77. Means given, Biometrika , Vol. xxb. 
pp. 866—867. HIT Aunals of Eugenict, Vol. n. p. 886. 
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Comparison of the several Basque series with one another and with other racial 
types was made by Professor Karl Pearson's method of the coefficient of racial 
likeness* Using the standard deviations given for the Farringdon Street series of 
seventeenth century London skullsf a coefficient is found between the male means 
of the* two Basque series of 0*20 ± *19 for 23 characters. There is thus complete 
statistical justification for considering that the two samples were drawn from 
identically the same population. No single character shows a significant difference, 
so it is probable that in cases where the two workers followed the same definitions 
of measurements they interpreted them in identically the same way. The standard 
deviations of the pooled Basque scries are given in Table III below and most of 
them are rather smaller than the corresponding English values. Using the Basque 
standard deviations, the coefficient between the series measured by Aranzadi and 
Morant respectively is increased to 0*35 + *19, but it is still insignificant. 

Comparison was made with a considerable number of other European cranial 
series J and all the coefficients found which denote close resemblance are given in 
Table II. The collections at Paris and Madrid are almost exactly equal in size and 
their corresponding coefficients with the other series arc of the same order. But the 
coefficient between any series and the pooled Basques is in every case greater than 
that between the same series and either of the two smaller Basque samples. These 
differences are evidently due to the differences in the sizes of the samples and not 
to differences in degrees of relationship. To obviate this effect, each coefficient was 
reduced to the value it would have if each series in the comparison contained 100 
individuals, and these adjusted values may be compared with one another directly. 
In 8 of the 11 cases the pooled Basque series has a coefficient lying between the 
two found with its component halves. The orders in which the 3 Basque series 
arrange the 11 other types arc similar, but by no means identical. Such divergences 
must be expected when small samples are being dealt with and it would evidently 
be fallacious to attribute them in this case to differences in racial constitution. 
The comparison with the pooled series will be most reliable. It is remarkably 
similar to the British Iron Age and Lowland Scottish types, the bonds in these cases 
being quite as close as those usually found between neighbouring and contemporary 
European races. The divergence from the Pompeians and Etruscans is a little greater. 
These connections are appreciably closer than any which can be found with French 
series. Comparison is made with two Parisian populations of different dates, and 
with a series of soldiers who probably came from the north of France. The coefficients 
with these are not less than the one with seventeenth century Londoners, though 


* “On the Coefficient of Racial Likeness.” lliometrika , Vol. xvm. 1926, pp. 105—117. The form of 
the coefficient used was: 


C.R.L. asS 


1 n,n< ^ (m, ~wy)a 
M n $ + n t > <r*- 


1 + ^ ±-67449 
M 




f B G. E. Hooke: “A Third Study of the English Skull, with Special Reference to the Farringdon 
Street Crania.” Ibid. pp. 1—56. 

X Coefficients of racial likeness between the Basque series measured by the present writer and 40 
other series are given in Ibid . Vol. xx b. 1928, pp. 317—327. 
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all are of a decidedly lower order than any which can be found between the Basques 
and any central or southern French series. Comparison could not be made with 
any modern Spanish types. Among the various races, all of Asiatic origin, which 
have at one time or another been supposed closely allied to the Basques, on the 
ground of physical or other evidence, the Lapps, Finns and Egyptians may be rejected 
entirely. No close relationships have been noted with adjacent types, but all the 
most closely allied ones, apart from the Guanche, belong to Western Europe. The 
connections in these cases are quite as close as any that can be found for a number 
of other European races. Although the Basque skull possesses some unusual features, 
it is not by any means an isolated type, and there is no reason to believe that its 
origin was essentially different from that of any other racial form of skull. Its 
geographical isolation has been explained by the wildest hypotheses, but the ethnic 
history of Western Europe in recent times has been so unsettled that that isolation, 
though unusual, is not surprising. 

The standard deviations and coefficients of variation of the characters available for 
the pooled series of male Basque skulls are given in Table III (p. 76). Comparison 
is there made with the coefficients of variation of the absolute measurements, and 
with the standard deviations of indices and angles, obtained for a seventeenth 
century London series from a single graveyard * and for an Egyptian series of the 
XXVIth—XXXth Dynasties also from a single cemeteryf. The variability of the 
former series may be supposed typical of a random sample taken from a racially 
homogeneous population coming from the less isolated parts of Europe in modern 
times. For every character in the table the English constants are in excess of the 
Egyptian, although few of the differences are definitely significant. Twenty-two 
measurements provide these data for all three series. In the case of 9 of them the 
Basque variabilities are between the English and Egyptian values; for 5 the Basque 
variabilities exceed the English and for 7 they are less than the Egyptian. For the 
remaining character—the cephalic index—the Basque and Egyptian standard 
deviations are exactly equal. Judging from all the measurements, the Basque series 
may be supposed to exceed the Egyptian in variability by as much as it falls short 
of the English. There is a surprisingly small difference in this respect between 
the isolated Spanish people from a single province and the Londoners. The futility 
of the assumption often made that the variabilities for any single measurement 
can provide a reliable measure of the relative degrees of homogeneity of a number 
of series is evident from the above comparison. 

The three type contours were constructed in the usual wayj for the 37 male 
Basque skulls preserved at Paris. They are reproduced in Figs. 1—3 and the mean 
measurements used in their construction are given in Tables IV—VI. The only 

* B. G. E. Hooke: “A Third Study of the English Skull, with Special Beferenoe to the Farringdon 
Street Orania.” Diometrika , Vol. xvni. 1926, pp. 1—55. The variabilities quoted are based on numbers 
of skulls varying between 48 and 158. 

f Karl Pearson and Adelaide G. Davin: “ On the Biometric Constants of the Human Skull.” Ibid . 
Vol. xvx. 1924, pp. 826—868. The variabilities quoted are all based on more than 760 skulls. 

X See Ibid . Vol. xiv. pp. 227—244. 
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closely related series for which these contours have been published is that of the 
seventeenth century Londoners from the Farringdon Street graveyard and com- 

TABLE III. 

Constants of Variation for the Basque and other Series . Male Skulls . 



Basques 


Standard 

Deviations 


Coefficients 
of Variation 


English 

(Farringdon 

Street) 


Egyptians 

(XXVIth-XXXth 

Dynasties) 


Coefficients of Variation 


L 

B 

n 

B 


Bi-astenomc B 


6*88 ±*38 
5-02 ±‘27 

3- 90 ±*22 
5*52 ±-30 
4*75 ±*26 

4- 95 ±*27 
4-28 ±-24 

14*20 ±-78 
6 *63 ±*36 
7*62 ±*42 
6*70±*37 
4 *04 ±*25 
4-61 ± *27 
5*21 ±*31 
4*83 ±*29 
3*01 ±*17 
1*74 ± *10 
1 *61 ± *09 
2 * 00 ± *11 
2 * 10 ± *12 
2*39 ±*13 
2*32+ *13 


3*70± 

•20 

3-42 ± 

•12 

3-09 ± 

•05 

3*51 ± 

*19 

4’14± 

•17 

3*43 ± 

•05 

4*09 ± 

*22 

4*73 ± 

*18 

4-28 ± 

•07 

4-60 + 

•25 

— 


— 


4*20 ± 

•23 

— 


— 


3*77 ± 

•21 

3*90 + 

•17 

— 


4*26 ± 

•24 

4*47 ± 

•20 

3*90 ± 

•06 

3*79± 

•21 

3*76 + 

*16 

3-36 ± 

•05 

5*13 ± 

•28 

5*00 ± 

•19 

4-48 + 

•08 

6*03 ± 

•33 

6*24 ± 

•25 

5-77 ± 

•09 

5*63 ± 

•31 

6*50 ± 

*27 

5-91 + 

•10 

5*73 ± 

•35 

6*31 ± 

•33 

5*90 ± 

•10 

5-02 ± 

•29 

5*66 + 

•34 

5*10± 

•08 

4*04± 

•24 

3-69 ± 

•27 

3*55 ± 

•06 

5*39 ± 

•32 

6*74 ± 

•38 

4-90 ± 

*08 

5*85 ± 

•33 

6-75 ± 

•45 

— 


+1 

8 

i> 

•43 

8*17 ± 

•43 

7*27 ± 

•12 

4*16± 

•23 

(R) 4*20± 

•22 

— 


6*04 ± 

*34 

(/?) 6*88± 

•36 

(R) 5-67 ± 

•09 

10*00 ± 

•56 

9-73 ± 

•51 

— 


6*64 ± 

•36 

8-15± 

■36 

6*95 ± 

■11 

7*66 ± 

•42 j 

7*19± 

*33 

7‘18± 

•11 


Standard Deviations 


100 H'\L 
100 B!H' 
100 {B-H')/L 

100 Q'fflGB 
100 NB/NH' 
100 0 2 / 0 / 


2*08 ±*15 
2*96 ±*16 
5*25 ±‘29 
3*48 ±-19 
5*23 ±*29 
4*88 ±*31 
3*32± *19 
4*38 ±*24 
3°*55±*22 
3 e *56f22 
2° *83 ±*17 



3*48 ±*14 
3-24 ±'14 
6*27 ±'23 
3*59 ±‘16 
5*90 ±*27 
6*24 ± *37 

(fl)6*40±*34 

3°*68± *22 
3°*63± *22 
3°*65±*22 


2*68 ± *06 


5*79 ±*09 
4*96 ±*08 


3° *31 ± *06 
3°*46±*06 
2* *66 ± *04 


parison will be restricted to them. The two transverse types are almost identical. The 
English figure has a height ( MA) 1*3 mm. less than the Basque, but that difference, 
and the maximum found between the parallels, are not significant. It may be noted 
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that the Basque section is almost perfectly symmetrical, the maximum difference 
between the right and left sides of the same parallel being 0*9 mm. in fevoor of the 
left side. For the English section the maximum difference is 2*7 mm. in favour of 
the right side. The horizontal type contour of the Basque skull is far more asym¬ 
metrical. All the parallels on the left side exceed the corresponding ones on the 
right and the differences increase as the lines approach the occiput. The left side 
of the 7th parallel exceeds the right by 1*4 mm., for the 8th the difference is 2*5, 
for the 9th 3 8, for the 10th 4'5 and for OJ it reaches a maximum of 5*6 mm. It is 
evident from the appearance of the figure that the occipital region is quite markedly 
asymmetrical. Such a condition is more accentuated on this figure than on any other 
type contour that has yet been constructed, and usually there is no suggestion of 
it. For the English section the maximum difference between the two sides of the 
same parallel is 1 5 mm. and in this case it is the right side which is in excess. 
Some confirmation of the asymmetry noted will be needed before it will be safe to 
assert that it is a racial characteristic of the Basque skull. When the outlines for 
the two races are superposed with the point F and the axes (F0) coincident there 
is a close correspondence anterior to the 7th parallels. The maximum breadths are 
exactly equal. But the English figure exceeds the Basque in length by 3*7 mm. If 
the symmetrical English contour is then rotated about F until there is an angle of 
0°*5 between the axes the correspondence becomes appreciably better. The outlines 

TABLE IV. 


Mean Measurements of 37 Male Basque Transverse Contours . 



MA 

1 

Mi 

‘ 2 

8 

4 

5 

6 

Bj 

It ... 

111-3 

58-6 

61-4 

63-7 

06-8 

68-7* 

68-9 

68-1 


L ... 

58-6 

61*0 

64-2 

67*7 

69-4 

69-7* 

68-1 




8 

9 

10 


ZRy 

ZRx 

R ... 

61-3 

52-2 

37'7 

19-5 

61-4 

2-4 

L ... 

60-7 

52-1 

37-5 

18-8 

61-7 

3-5 


* Mean of 86 oontours. 


are now almost coincident as far back as the 8th parallel of the English type. This 
suggests that it is merely the position of the point 0 —below the lambda—which 
is asymmetrically placed on the Basque type f. 

t The writer i8 willing to admit that there may have been a constant error in marking the point 0 
on the individual contours, but there has been no suggestion of this in the oase of the other series with 
whioh he has dealt. 
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FiG.IT. HorigonW TVjjze Contour, based on 37 6 Basque {Skulls. 












TABLE Vt 

Mean Measurements of 37 Male Basque Median Sagittal Contours, 


* Ny 

Ordinates above Ny , . ’ " 

0 =N 

Ni 

1 

2 



I 

6 

7 

180-8 (37) 

19*5 (37) 

36*8 (30) 

50-7 (37) 

09*2 (37) 




79-8 (37) 

78-6 (37) 


Ordinates above Ny 

Ordinates below Ny 

8 

65*6 (37) 

9 

7l 

y\ 

Ni 

1 

2 

8 

9 

Yi 

46*3 (37) 

20*1 (37) 

14*5 (37) 

64*7 (31) 

57*1 (37) 

54-2 (37) 

52*0 (37) 

44-4 (37) 

34*2 (37) 


Ordinates 
below Ny 

Vertex 

Bregma 

Glabella 

Occipital Point 

yi 

x from N 

y 

x from N 

V 

x from N 

V 

x from y 

y 

26*0 (37) 

91*0 (37) 

81*4 (37) 

77*1 (37) 

80*3 (37) 

2*5 (37) 

8-7 (37) 

0*5 (37) 

-3*7 (37)* 


Lambda 

Sub.-Orb. Point 

Auricular Point 

Opisthion 

Inion 

x from 7 

V 

x from N 

y 

x from 7 

V 

x from 7 

y 

x from 7 

y 

5*9 (37) 

21*3 (37) 

10-1 (37) 

31*1 (37) 

95*2 (37) 

31-1 (37) 

58*8 (37) 

57*2 (37) 

15-2 (37) 

43*3 (37) 













Basion 

Alveolar Point 

Nose 


from 7 

from N 

from N 

from Bas. 

(i) 

(ii) 

(iii) 

L LNy 

NL 


106*6 (36) 

99*1 (36) 

72*4 (19) 

90*1 (19) 

1*6 (37) 

3*8 (37) 

7*6 (31) 

126°*2 ( 12 ) 

23*0 (12) 



Frontal 

Occipital 

Max. Sub. to NX 

Max. Sub. to OI 

Sp. 

Max. Sub. to Np 

Max. Sub. to X Op. 

x from N 

y 

x from X 

V 

x from N 

y 

x from O 

y 

x from N 

y 

60*2 (37) 

26*6 (37) 

52*7 (37) 

29*3 (37) 

81*2 (37) 

71*2 (37) 

100*7 (37) 

102*2 (37) 

65*3 (36) 

34*0 (36) 


Sub. from i Bas.-Sp. 
Chord 

Palate 

N. S. 

Crossing of 
Alv.-N.S. Chord 

P ' 

P 

x from Bas. 

|M 


n 

x from Alv. 

y 

x from N 

pn 

from Alv. 

12*9 (36) 

0*5 (36) 

44*9 (37) 

53*4 (37) 

32*7 (19) 

17-8 (19) 

5*6 (37) 

88-0 (87) 

8-9(19) 


The occipital point U below the Ny line, 
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Fig.HL S&gitteJ Type Contour, based on 37 <5 Basque Skulls. 
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A Contribution to Basque Craniometry 

Greater differences are found between the sagittal type contours of the two 
series compared. When they are superposed with Ny lines and the nasions coin¬ 
cident the outlines of the nasal bones and the glabella regions also coincide. The 
English skull has an extremely retreating frontal bone, but the Basque does not 
differ from it greatly in this respect. The angle fiNy is 45 0, 1 for the former and 
46°*2 for the latter. The contours cross near their vertices and the outline of the 
obelion and occipital regions is significantly more protruding for the English than 
for the Basque figure. The contours cross again near the inions, which are almost 
coincident, and from the inion to the opisthion the Basque outline recedes 
furthor from the Ny line. The lines indicating the basi-occipitals are practically 
coincident. The areas of the two calvarial sections in this plane are almost equal, 
the English being slightly the greater. The roofs of the palates are exactly equi¬ 
distant from the horizontal base line and the same is almost true for the anterior 
nasal spines and alveolar points, but there is a sensible difference between the 
prognathism of the English and Basque types. The resemblances of the three 
contours compared are so close that it can only be inferred that the two racial 
types are intimately related. The comparison of direct measurements has shown, 
however, that there are some other Western European types which appear to 
be more closely related to the Basques than the seventeenth century Londoners 
are. 

Conclusions. The principal purpose of the present paper has been to provide 
individual and mean measurements of a series of 39 male Basque skulls preserved 
in the Musee Broca. They came from Zaraus in the Spanish province of Guipuzcoa. 
Comparison with another series measured by Aranzadi of 37 male specimens from 
the same province shows that there is full justification for considering that the 
two samples were drawn randomly from the same homogeneous population. The 
variability of that population, as measured by the pooled samples, is rather less than 
for Londoners interred in a single seventeenth century graveyard, but greater than 
for a dynastic Egyptian series. The Basque skull is characterised by a peculiarly 
narrow facial skeleton—so that its nasal and upper facial indices are almost 
extreme for all races of man—and it is markedly orthognathous. These features 
do not dissociate the type from neighbouring ones. A close resemblance is found 
to several other Western European races. In spite of their present-day isolation, 
it is extremely probable that the Basques are more closely related to some existing 
or extinct races of Western Europe than to any others. 

APPENDIX I. Definitions of Measurements . 

The greater number of the following index letters denoting measurements are 
those used normally by workers in the Biometric Laboratory: Flower’s ophryo- 

occipital length. L = maximum glabella-occipital length in median sagittal plane. 
B = maximum calvarial breadth. B f — minimum frontal breadth. B " = maximum 
frontal breadth. Bi-asterionic B *= chord asterion R. to asterion L. if * Frankfurt 
vertical height from basion. H' = basio-bregmatic height. LB = nasion to basion. 
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Qf = Frankfurt vertical transverse arc from auricular point R. to auricular point L.* 
Bregmatic Q f ® transverse arc similar to Qf through bregma. Broca's transverse 
arc = arc terminating at points sus-auriculaires R. and L. and passing through 
bregma. Bi-auricular B = breadth between points sus-auriculaires R. and L. 

« median sagittal arc from nasion to opisthion. 8% m arc nasion to bregma. S 2 «= arc 
bregma to lambda. $ 8 = arc lambda to opisthion. Sf “ chord nasion to bregma. 
Sf * chord bregma to lambda. Sf = chord lambda to opisthion. U =» maximum 
horizontal circumference above superciliary ridge and through ophryon. Glabella 
U * maximum horizontal circumference through glabella. /mi=basion to opisthion. 
fmb = maximum breadth of foramen magnum . PH = tip of anterior nasal spine to 
alveolar point. 0H -nasion to alveolar point. = basion to alveolar point. 
OB = chord between lowest points on malar-maxillary sutures R. and L. J *= 
maximum bi-zygomatic breadth. External bi-orbital B «= maximum breadth 
between external surfaces of orbital processes of frontal bone. Bi-jugal B = breadth 
between Brocas points jugals R. and L. NH' = nasal height from nasion to base 
of anterior nasal spine. NH , R. and L. = Frankfurt nasal height from nasion to 
lowest point on edge of pyriform aperture R. and L, NB = maximum breadth of 
pyriform aperture. D6 V = chord dacryon R. to dacryon L. DA =arc dacryon R. to 
dacryon L. DS — minimum subtense from bridge of nose to dacryal chord. SC=* 
minimum chord between naso-maxillary sutures. SS — subtense from bridge of 
nose to simotic chord. Oj, R. and L. = maximum breadth of orbit R. and L. using 
curvature method (see Biometrika , Vol. I. p. 130 and Vol. vm. pp. 311 and 312). 
(V, R. and L. — orbital breadth from dacryon R. and L. 0 a , R. and L. = orbital 
height R. and L. whether taken perpendicular to 0 X or Oi'. (?!« palate length from 
tip of posterior nasal spine to median point on an imaginary line tangential to inner 
alveolar borders of the central incisors. Of = palate length from base of posterior 
nasal spine to same anterior terminal. Cr 2 = palate breadth between inner alveolar 
walls at second molars. EH — palate depth from 0% chord taken with Pearson's 
uraniscometer. Basio-palatal L = basion to tip of posterior nasal spine. 

Various indices are calculated from the above absolute measurements. The 

g I g 

Occipital Index ( Oc. /.), defined to be 100 -r^ ./ „ . - 3 -~— , was found with the 

aid of Tildesley’s table of this function ( Biometrika, Vol. xili. pp. 261—262). PL is 
the angle between the line joining nasion to alveolar point—not the prosthion— 
and the Frankfurt horizontal plane. NL, A L and BL are the angles of the 
triangle of which the nasion, alveolar point and basion are the apices. They were 
found from the chords 0'H> OL and LB in the manner described by Fawcett 
(Biometrika , Vol. I. p. 418) with the aid of Pearson's trigonometer. 0iis the angle 
between the line joining basion to nasion and the Frankfurt horizontal, i.e. 
180° — PL — NL. 0 a is the angle between the line joining basion to alveolar point 
and the horizontal, i.e. PL—AL, Daubenton's L is, by Broca's definition, the 
angle between the sagittal axis of the foramen magnum (i.e. the chord joining 

* These auricular points correspond to Martin’s “porionB.” 
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A Contribution to Basque Craniometry' 

basion and opisthion) and the line joining the opisthion to the meet of the t{gm' 
sous orbitaire with the median sagittal plane. It is positive if the basion falls below 
that line. 

Plates. Plates I—V show five aspects of a normal male Basque skull taken 
with the focal plane of the camera parallel or perpendicular to the Frankfurt 
horizontal plane. The specimen is No. 58 of the first (1862) series in the Mus$e 
Broca, and its measurements are given in Appendix II. The reduction is approxi¬ 
mately to eight-tenths natural size. 



Biometr&a, Vol. XXI Plate I 

Moranl, The Basque Skull 




Skull of Adult Male. Norma facialis. 
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Morant, The Basque Shull 
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Morant, The Basque Skull 


Plate III 



Skull of Adult Male. Norma verllcalis. 
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Morant, The Basque Skull 


Plate IV 



Skull of Adult Male. Norma basalls, 
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Morant, The Basque Skull 


Plate V 



Skull of Adult Male. Norma occipitalis. (The bright elliptical spots below inion 
belong to craniophor.) 






ON MEASUREMENT OF THE INTERNAL DIAMETERS OF 
THE SKULL IN RELATION: 

(I) TO THE PREDICTION OF ITS CAPACITY, 

(II) TO THE “PRE-EMINENCE” OF THE LEFT 
HEMISPHERE. 

By M. F. HOADLEY (with some assistance from K. PEARSON). 

(I) On the Prediction of Cranial Capacity from Interned and from External 
Measurements. 

(i) Introductory. It is w^ll known, at any rate to readers of BiometriJca , that 
prediction formulae for the capacity of the skull can be based on the measurement 
of external diameters or external arcs, and that within a definite race they give 
fairly satisfactory results. But the application of formulae found from the data for 
one race to the determination of the capacity of a second race is not so satisfactory 
and caution must be used in drawing inferences from even mean capacities so 
determined. The need for such formulae is considerable; it may arise: (i) from the 
condition of a series of crania being such that either from their broken or from their 
fragile condition it is not possible to obtain the cranial capacity directly; (ii) from 
a desire to ascertain cranial capacity from measurements on the living. The latter 
need can of course not be met by any proposal to replace external by internal 
measurements. The present paper arises from the consideration of whether in the 
former case we shall much improve our prediction formulae by using internal 
instead of external measurements. This involves a consideration of the further 
question of whether the gain is so considerable that it more than compensates for 
the increased labour of taking the additional measurements. 

After a consideration of various suggested instruments for measuring internal 
lengths of the cranium by insertion through the foramen magnum •, we saw in the 
Anthropologischer Anzeiger.Iahvg. ir, S. 129—31, an article by Dr Hans Weinert 
entitled “Ein neuer Messzirkel zur Ermittelung von Innenmassen,” and we con¬ 
cluded that it was easier to experiment with an already constructed instrument 
than to experiment in constructing one. Accordingly we procured from Messrs 
Alig and Baumgartel, of Aschaftenburg, one of Dr Weinert’s instruments. No 
directions for the use of the instrument are provided with it or in the above 
paper, only in the latter we find a diagram illustrating what is apparently 
the median sagittal plane of the skull with the instrument set for measuring 
the length. In this diagram the maximum foraminal length is almost as great 

* Inolading an ingenious model by Sir W. Flinders Petrie. 
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as the distance from the basion to the tip of the dorsum sellae; with such a 
magnitude of the foramen it is possible to place the hinge of the measuring 
circle (Weinert’s callipers) entirely within the foramen as in the diagram* 
With crania with more moderate foramina we have not found this possible, 
and greater difficulty may then arise in taking the internal measurements; it is 
less easy to grope for the maximum diameter. It is ungracious to criticise an 
instrument which has been designed and constructed with considerable thought 
and care, but if it be feasible to reduce the size of the hinge and of the arms in the 
immediate neighbourhood of the hinge, we believe it would be a distinct advantage. 
For taking the internal height of the skull this instrument was not used. The skull 
in norma basalis was adjusted to the Frankfurt plane and the height (Hi) from 
basion to inner table then measured by a vertical rod. The maximum internal 
breadth (2?*) was ascertained by aid of Weinert’s callipers. We found it required some 
experience for the same person to repeat closely the same measurement, but after 
practise this became easy. Several measurements were taken, and if these were in 
close agreement the largest was adopted as the maximum internal breadth; if the 
measurements were not in close agreement the measurer started afresh. The 
greatest care was taken to maintain the line between the measuring points of the 
callipers horizontal and perpendicular to the median sagittal plane; the skull rested 
during the measurement with foramen upwards in the standard position. After 
considerable practise the taking of breadths internally seemed to have reached a 
satisfactory degree of accuracy, but no craniometrician could, we believe, purchase 
Weinert’s callipers, and straight off, without a large amount of experimenting, hope 
to obtain exact results even for the internal breadth. The handling of the callipers 
requires to be patiently learnt. 

Now the object of measuring the internal diameters of the brain box is to obtain 
a system of measurements more nearly representing the cavity itself than can be 
found when we include (by using external diameters) the thickness of the bone and 
that of the frontal sinus. Dr Weinert has applied his callipers to determine the 
internal diameters, in particular the length of the cranium, in numerous specimens 
of mammals and in particular of the primates including man*, but we cannot find 
that he has given any very detailed account of how the length is to be taken. On 
pp. 354—350 we have much the same account of his callipers as in the Anthropo- 
logi$ch#r Anzeiger . The diagrams, p. 344, seem to suggest that he takes it in the 
sagittal plane. Now it seems to us that the objections to this are twofold, one 
arising from the purpose to be obtained by the measurement, and the other from 
the difficulty of manipulating the instrument. The reader who will examine a sec¬ 
tioned skull will find that the median sagittal section is ridged to a greater or less 
extent both anteriorly and posteriorly; we have the crest for the attachment of the 
falx cerebri, the crista galli of the ethmoid, and the internal occipital protuberance 
and the sagittal ridge associated with it. These ridges are difficult to feel with the 
tip of the callipers, and if one tip has been adjusted to the anterior ridge, it will 

* See his memoir 11 Die Ausbildang der Stirnhdhlen als stammesgeschiohtliohes Merkmal,” Zeit * 
tchnft filr Morphologic und Anthropologic , Bd. xxv. S. 348—857, 865—418. 
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be almost sure to slip off while the other tip is being adjusted to the posterior 
ridge; we found it practically impossible to determine satisfactorily, by groping 
about in the unseen cavity of the skull, the maximum distanoe between two 
ridges of this kind. We do not believe that the maximum internal diameter in the 
sagittal plane can be satisfactorily determined, at least with callipers pointed as in 
Weinert’s instrument; to fix the reading face of the callipers on the ridge a flat 
surface to the tip is needful. But we hold that if the ridges could be obtained by 
successful groping and the maximum distance between them ascertained, we should 
be reducing the true cranial maximum distance by bony excrescences in precisely 
the same manner as the external length is fictitiously magnified by the thickness 
of bone and the frontal sinus. We therefore rejected after trial the attempt to 
obtain an internal diameter in the sagittal plane. A slight examination of the brain 
box shows that the maximum longitudinal diameters of the cavity lie right and 
left of the anterior and posterior median ridges; these are relatively easy to ascertain 
and accordingly we took as our maximum internal length (Li) the mean of the two 
internal maximum lengths measured right and left of the median ridges and 
parallel to the median sagittal plane. The success of this approach to the problem 
will be appreciated when we say that Li thus found had the high correlation of 
*8193 with the cranial capacity as found by seed in the usual manner of the 
Biometric Laboratory. 

(ii) Material Selected . The material chosen for this investigation consisted of 
729 adult male skulls of the long 26th—30th Dynasties Egyptian series in the 
Biometric Laboratory. The necessary external measurements, viz. L = glabellar- 
occipital length, B = maximum parietal breadth, H = basion to point vertically above 
it with skull adjusted on craniophor to Frankfurt horizontal, and also (7= capacity 
obtained by packing skull tightly with mustard seed and then weighing, were of 
course already available. The three internal measurements Li, Bi and Hi were 
taken as described in the above introductory note. 

Twenty correlation tables were formed between the various measurements. It 
was then possible to obtain all the other twenty-seven correlations by aid of formulae 
involving the correlations already found and the known standard deviations. This 
simplification arises, because we are dealing with various differences in the thickness 
of the skull found by subtracting an internal from an external measurement. The 
correlations r L%Li and r BtB „ Bi were found both by forming tables and by formulae, 
in order to estimate the degree of divergence when such formulae are used instead 
of forming tables. We have 

r i,L% ■* ’78042 by table 
= *78169 by formula. 

-*12761 by table 
= — *12831 by formula. 

These differences are due to the grouping in the tables and are of no importance 
for our present purpose. 
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The formulae used for calculating these indirectly deduced correlations were 
the following: 

~ a\-*j 




( 1 ) 7 Of, 

From this were found r LtLi> r BtBi and 

( 2 ) 


2a (e a SH 


<r*- 


• x, x—x % - 


'XyZX • <T*i 


'X—tCi 


From this were found r L>L __ Ij%) r and the four corresponding correlations for 
breadth and height. 

~ _ a v r *, v — a vi r ^yi 

\°) . 

°v-i ti 

This gave r CtL ~ Li and two corresponding ones r CtB „ B{ , r 0tB - E{ , and further 
r L f B-~m> r u* b-B{> r L,H~Hi> r Li,H-Hi> y-y< and the eight corresponding correlations 

(4) r = ax * av i ror i' y * (J% °V r *V Z <J% ayiT% ' Vi ~~ (7x ~ x < ^V-ViV~V i 

G %i • G y 

This provides r H%tB , r^ (H and r UB . 

Table I gives the means, standard deviations and coefficients of variation. The 
internal length is considerably more variable than the external length while in the 


TABLE I. 



Mean 

S.D. 

V 

c 

1440*30 4- 2*84 

113*67 ±2*01 

7*89+ *14 

L 

185*53 + -14 

5*63+ *10 

3*04+*05 

A 

170-44+ *14 

5*77 ± *10 

3*39+06 

a 

139-16+ *11 

4-65+ -08 

3*34+ *06 

A 

132-14+ -11 

4*41+ *08 

3*33+*06 

// 

134*31+ *13 

5-02+ *09 

3*74+*07 

/A 

128-85+ *12 

4*86 + *07 

3*77 ±*07 

I--Li 

15*10± *09 

3*77 ± *07 

24*95 ±*47 

li-Bi 

7 *03 ± *05 

2*15 ± *04 

30*58+*59 

11-U , 

5*46± *04 

1*75+ *03 

32*23+ *63 

/’ 

3472-91+5*67 

266*98 + 4*71 

7*69+*14 

1\ 

2900*52 + 5*82 

232*96+4*11 

8*02+*14 

a,* 

171*04+ *15 

6*96± *11 

3*48±*06 * 

Li* 

170*05 ± *14 

6*69 ± *10 

3*35+*00 


case of the breadths and heights the difference in the coefficients is negligible, 
being well within the probable error. Consequently the internal product is slightly 
more variable than the external one. 

The correlations are given in Table II and are classified in Table III (p. 91). There 
is no significant difference in the correlations r CtB and r CtBi but r 0tL{ shows a con¬ 
siderable increase on r c%L while r CtBi is slightly greater than r 0|J8 . The total result 
of these values is to produce a correlation between capacity and internal product 
# From second series of measurements. L 4 =J (L r +L l ) in this table is from the first series. 
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TABLE II. 

Correlations. 



L 

Li 

B 

Bi 

H 


L-Li 


H-Hi 

P 

p. 

c 

•67246 

±•0137 


■67563 

±•0136 

•72414 

±•0119 


■53434 
± '0178 






L 

— 

•78042 

±•0098 

•41280 

±•0207 

•38576 
± *0213 

•32110 
± '0224 

*26979 
± *0232 

•29586 

±•0228 


•17102 

±•0243 

— 

— 

Li 

•78042 

±•0098 

■ 

•42861 

±•0204 

•46530 

±*0196 

•46709 

±•0195 

•45800 
± *0197 

- *36448 
±•0217 

-•02667 

±•0250 

•06730 

±•0249 

— 

— 

B 

•41280 

±•0207 

•42861 

±*0204 

— 

*88827 

±•0053 

•33392 

±•0222 

•28140 

±•0230 

- -04011 
±•0249 

•34156 

±•0221 

•17553 
± -0242 

- 

— 

B ( 

•38576 

±•0213 

•46530 

±•0196 

•88827 

±•0053 

- 

•27035 

±*0232 

•24149 

±•0235 

-•13668 

±•0245 

-•12761 

±•0246 

•10429 

±•0247 

- 

— 

H 

•32110 

±•0224 

•46709 

±•0195 

•33392 

±•0222 

•27035 

±•0232 

— 

•93734 j 
±•0030 

-•23597 
± *0236 

•16779 
+ •0243 

•26368 
± '0232 

— 

— 

LTi 

•26979 

±•0232 

•45800 
± -0197 

•28140 

±•0230 

•24149 

±•0235 

•93734 

±•0030 

— 

- ’29866 
±•0228 

•11324 

±*0247 

- *08892 
±•0248 

— 

- 

L-L x 

•29586 
± ’0228 

- *36448 
±•0217 

-■04011 

±•0249 

- -mee 

± -0245 

- -23597 
± '0236 

- *29866 
± *0228 

— 

•19306 

±•0241 

•15225 

±•0244 

i _ 

— 


•10195 
± *0247 

-•02667 

±•0250 

•34156 

±•0221 

- *12761 
±•0246 

•16779 

±•0243 

•11324 

±•0247 

•19306 
± *0241 

— 

•16618 

±•0243 

— 

- 

H-ffi 

•17102 
± *0243 

•06730 

±•0249 

•17553 
± -0242 

•10429 
± -0247 

•26368 

±•0232 

-•08892 

±•0248 

•15225 

±•0244 

•16618 

±•0243 

— 

— 

— 


of - 89586 as compared with 82081 for external product and capacity. The corre¬ 
lations r liLi , r BiBi , r BiB( are all high, H and Hi being the most closely correlated 
and L and the least. 

It must be remembered that whereas H - Hi is a measure of the thickness of 
the bone at the top of the skull, B-B ( and L-L t are respectively twice the 
thickness of the parietal bone aud the thickness of the frontal plus the occipital 
bones. The mean thickness on the parietal bone where the skull has greatest hori¬ 
zontal breadth is thus 8 52 approximately. 

On the few divided male crania in the Laboratory these values for the bone 
thickness were by no means unreasonable, especially considering the light build of 
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the Egyptian skull. Of the 15 mm. thickness of bone in the cranial length about 
$ may be taken to be frontal and £ occipital. 

The correlations r LtL _ Li , r B>B „ Bi , are all positive and lie between *25 

and '35 indicating that the thickness increases significantly but not rapidly with 
increase of the external measurement. On the other hand, it is interesting to note 
that r Li ' L - Li , r Blt ji-. Bv are all negative, but the only one of importance is 

the first which is — *36. 

We conclude from the first of these results that a large skull has on the average 
a large thickness of bone, and that a large brain cavity has on the average a small 
bone thickness. It will be seen that these conclusions are what we should expect 
as an outcome of ‘‘spurious correlation,” if the internal and external measurements 
were really uncorrelated. 

Next turning to the three thicknesses X —X t -, jB — B{ and H — Hi we see that 
while only very moderately correlated with each other, the correlations are all 
positive, or a thick brain case tends to be thick in all directions. Thickness of bone 
in the length is more highly correlated with that of the breadth, i.e. on the parietals, 
than with that of the height, i.e. at the apex of the skull; and the thickness at the 
apex is slightly but not significantly more correlated with that on the parietals 
than with the sum of the thicknesses on the frontal and occipital bones. As a 
matter of fact none of these bone thickness correlations are really significantly 
different, and it would suffice to say that the thicknesses of the skull in those places 
tested have with one another the very moderate correlation of about ‘17. From 
this it is clear that there is a great deal of independence about the thickness of 
cranial bone at different points. Turning to the influence of bone thickness on 
cranial capacity, we see that the thickness on the parietals has no sensible influence 
on the capacity; that at the apex has a very slight positive effect, or if a skull has 
a thick crown the capacity might be expected to be very minutely larger. 

It will be seen that the highest correlations are between corresponding external 
and internal lengths, these correlations being in the descending order heights, 
breadths, lengths*. The correlations of the length, breadth and height thicknesses 
with the external diameters are on the whole so small that there is little hope of 
obtaining any close estimate of the thicknesses from these external measurements. 
To this point we shall return later. As a rule an external measurement is more highly 
correlated with another external measurement than with the corresponding internal 
measurement, but the external breadth and height are more closely associated with 
the internal than with the external length; this is markedly the case for the external 
height. In the case of the internal measurements they also are not always more 
highly correlated with each other than with the corresponding external measure¬ 
ments. Thus Li is more highly correlated with B{ than with B , but is slightly more 
correlated with H than Bi. B{ again is more highly correlated with X< than with 
X, but slightly more correlated with H than Hi. Finally if< is far more highly 
correlated with X t - than with X, but somewhat less correlated with 1 than 1 B. 

* This order is precisely that of the thiokness of bone in these direction*. 
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The first line of Table II indicates the advantages to be obtained by using 
internal rather than external measurements. We note that the correlations of the 
internal and external heights with capacity are nearly equal; what advantage there 
is being in favour of the external height. But the internal breadth with capacity 
exceeds by 7*2 °/„ the external breadth with capacity correlation, while that of 
capacity with internal length is 21*8 °/ 0 better than that with external length. 
It thus appears clear that the internal measurements, difficult as they are to 
make, will give in the case of the length considerably and in case of the breadth 
slightly better results than the corresponding external measurements. Accordingly 
the continuous products P for the external and Pi for the internal diameters were 
formed and the regression equations of the capacity C on these determined, with 


the following results for probable capacity C: 

6 * 0003495 P + 226*52 ± 44/n/ti .(a), 

C s= *000437 2 P* + 1 69*57 ± 34 /Vn .(5). 


It will be seen that the probable error of the estimate is reduced from 44/Vw to 
34/Vn or 22*7 °/ G by using internal instead of external diameters. The correlation 
between capacity and continuous product was raised from 82081 to *89586, the 
chief factor here being undoubtedly the change from L to Li. 

Now this increased accuracy of prediction is of great importance, but it involves 
serious consequences, for undoubtedly the determination of the internal measure¬ 
ments is a difficult task and not to be lightly undertaken. Accordingly attempts 
were made to determine L it B { and Hi from L } B and H, and then to use (6). 

The regression formulae are: 

O.QK11Q 

= *69527 L + *09155 B + *25867 H - 6*04011 ± 

vw 

Bi = -02423 L + -84103 B - -03156 H + 14-81886 ± 1 ' 3 '’ 994 . ,( c ). 

sin 


Ht = — 02217 L - -02797 B+ -92439 H+ 12-70355 ± 113385 

Vtt 

A first sample of 20 skulls was taken, but not at random, being distributed with 
rough uniformity over the whole range of capacity, rather than at random over the 
distribution of frequency. The results were as follows: 


Table of *67449 x Square Root Mean Square Residual. 
1st Sample , 20 crania. 


(i) Internal Product from ( b ) . 

(ii) I nternal Diameters obtained by sub-1 

tracting mean thicknesses from V 
external diameters and using (6) J 

(iii) 1^ found from (c), and B { and //< byl 

subtracting mean thicknesses J 

(iv) External Product from (a) . 

(v) L if Bi, Bi all found from (c) andl 

then (6) used J 


39*69 cm. 3 , expected 34 cm. 3 
42*15 „ 

45*75 „ 

48*33 „ expected 44 cm. 3 
52*02 „ 
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(iii) was undertaken with a view of reducing the labour of (v), as is not better 
than JET, and Bi is only slightly better than B. The result is somewhat anomalous. 
After actual internal measurement of the three diameters, we obtain the olosest 
estimate by merely subtracting the average thicknesses from the corresponding 
external measurements. The next best result is when we obtain only from the 
regression formulae (c), and B { and Hi by subtracting mean thicknesses. The worst 
result—worse than using the external diametral product with (a)—arises from 
finding Lu Bi and Hi from the regression formulae (c). This is of course paradoxical, 
but the source of the paradox is to be sought, we think, in the sample, which had 
emphasised too much the skulls with very small and very large capacities, and it is 
usually towards the tails that the linear regression equations prove less satisfoctoiy. 
It will be noted that the values obtained by both (a) and (6) are considerably in 
excess of the expected. 

It seemed advisable accordingly to select a second sample, this time of 24 skulls, 
choosing them at random from the frequency, actually by taking out every thirtieth 
card from the pack of data cards. 


Table of *67449 x Square Root Mean Square Residual . 
2nd Sample , 24 crania. 


(i) 

(ii) 

(iii) 

(iv) 


(v) 


(vi) 


From Internal Diametral Product. 

From values of L { , Hi from (c). 

From External Diametral Product. 

By subtracting mean thicknesses from! 

external diamoters J 

By subtracting mean thicknesses to obtain^ 
Bi and Ht and determining from L !■ 

and H+ J 

By subtracting mean thicknesses to obtain 'j 
Bi and and determining L { from L^B\ 
and U J 


33*27 cm. 3 , expected 34 cm. 3 
42*82 cm. 3 

43*32 cm 3 , expected 44 cm. 3 
44*97 cm. 3 

45*84 cm 3 


45 93 cm. 3 


The expected result from external measurements is 44 cm. 8 ; the sample gives 
43. The best result obtained by endeavouring to estimate the internal from the 
external diameters is provided by (ii), practically 43 also. All the other values are 
in excess of 44, and none in any way approaches the internal diametral product 
value of 33. Accordingly we do not seem able to predict with any serviceable degree 
of accuracy Li, Bi and H { from L, B and H with a view to using (6). 

The general conclusions of this section of our paper are: (i) that internal 
measurements have an accuracy about 23 % greater than external measurements; 
(ii) that the thicknesses of the bone are so slightly correlated with the external 
measurements that it is not possible to obtain from the latter good approximations 
to their values; and (iii) that we must either be contented with the degree of accuracy 
provided by the external diametral product, or if we wish to improve on that accuracy 
we must practise the rather difficult technique of internal measurement. 

* The requisite regression formula is I*=*722492 L + '977208 H - *84808. 
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If the process of internal measurement has only to be applied occasionally to 
isolated skulls, we are inclined to doubt whether the technique is worth learning 
and facility in it maintained for the case of such increased accuracy, and this con¬ 
clusion may be emphasised by the fact that such isolated skulls may not belong 
to the same race or even to a race allied to the race (or possibly one or two races) 
for which the prediction formula for internal measurements may have been ascer¬ 
tained. 

(II) On the Pre-eminence of one or other Cerebral Hemisphere . 

(i) Introductory. Aretaeus, a Greek physician of the first century of our era, 
was probably the first observer who has recorded the decussation of the pyramidal 
tracts, those from the left side passing to the right hemisphere and those from the 
right side to the left hemisphere*. But he did not apparently note that injuries 
to the left side of the brain were accompanied not only by paralysis of the right side, 
but often by loss of speech. Though there may have been isolated cases previously 
noted, Dax in 1865 appears to have been the first who emphasised the view that 
injury to the left side of the brain was often accompanied by aphasia. The associa¬ 
tion of aphasia with paralysis of the right side as a result of injury to the left side 
of the brain has accordingly been spoken of as Dax’s Lawf. On the basis of this 
law the faculty of speech was associated with the left hemisphere and taken in con¬ 
junction with right-handedness it became customary to speak of a “pre-eminence” 
of the left hemisphere. There has, as far as we are aware, been no exact definition 
of this “pre-eminence” as to whether it is to bo sought for in sensation, perception, 
volition, etc., but it was supposed that if it existed it would certainly manifest itself 
in physical characters, which could be measured or appreciated. And search was 
made for somewhat gross physical differences between the right and left sides of the 
brain, in particular between the right and left cerebral hemispheres, at first in 
weight or volume and later in the depth of furrows and complexity of folds. The 
investigation was rendered exceedingly difficult owing to the task of dividing the 
hemispheres from one another, to the fact that some of the brains may have belonged 
to sinistral not dextral individuals, and that even if the ante-mortem laterality of 
the individual had been ascertained the factor of lateral educability could be called 

* In his Book v, Chapter vii “On Palsy,” he writes: 

“Should any part below the head begin to be affected suoh as the membrane enclosing the spiral 
marrow, then parts which are synonymous and connected suffer from the resolution, viz. those on the 
right from an affeotion of the right side,... But if the head is first affected on the right side, the nerves 
on the left suffer, and again the nerves on the right from a resolution taking place on the left, which iB 
owing to a change in the oourse of the nerves, for those that begin from the right do not run in a straight 
line on the same Bide to their extremities, but immediately after their origin or rise paBS to the opposite 
side, interchanging one with another like the letter X.” Aretaeus , consisting of eight Books on the Causes , 
Symptoms and Cure of acute and chronic Diseases; translated from the original Creek . By John Moffat, 
M.D., London. 

f “Lesions de la moiti5 gauche de l’eno^phale coincident avec l’oubli des signes de la Pens4e,” 
Gazette hehdomadaire de mtdecine et de chirurgie t T. n. S6rie 2, pp. 259—262, Paris, 1665. Dax collected 
871 observations, and in 87 oases a lesion of the left hemisphere ooinoided with a lesion of the faculty 
of speech; 58 cases of lesion of the right hemisphere were accompanied by conservation of this faculty. 
Six cases appeared to contradict the law and 225 oases provided no information one way or the other. 
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into play to account for apparently exceptional cases. Added to all this there is the 
question of errors of observation, and in none of the investigations we have come 
across, although the errors of measurement were admitted to be large, was there any 
attempt to distinguish them from the errors of random sampling. Various writers 
decided not to pay attention to differences in weight, for example, under 3 grs. or 
under 10 grs., as they imagined the order of their errors of measurement to be lower 
or higher. Considering the amount of fluid in the brain, and how it may drain off 
when the hemispheres are separated, it is quite conceivable that if an anatomist 
had the habit of measuring the left hemisphere before the right, he would find the 
former the heavier*. The difficulties of hemisphere weighing are so great that it 
is little wonder that the “ pre-eminence of the left hemisphere/’ if it be associated 
with greater weight, has remained unproven. 

The principle of the “ pre-eminence ” of the left hemisphere was carried far in 
England by Boyd and Ogle, and in France by Broca, and seems to have become 
almost dogmatic until quite recent times f. 

R. Boyd published in the Phil . Trans, a long series of brain weights in 1861J. 
Unfortunately he did not provide the individual values, but only the means in 
certain age groups. His Table II (pp. 254—262) contains the results of measuring 
the weights of 295 male and 233 female patients of the Somerset County Lunatic 
Asylum. In the case of 290 of the former and 229 of the latter the weights of the 
right and left cerebal hemispheres are given separately. Unfortunately no details 
are provided of the methods of measurement nor are we told how the hemispheres 
were divided. 

The results in ozs. are as follows: 

Mean Weights of Ilemisphei'es. 


Ages in 
years 

Men 

Women 

No. 

Bight 

Left 

No. 

Bight 

Left 

Under SO 

44 

20*89 

21*05 

30 

19*21 

19-51 

so—40 

61 

19-82 

19-94 

46 

18-63 

18-84 

40—50 

76 

19*49 

19-67 

48 

18-05 

18*24 

50—GO 

42 

20-44 

20-73 

39 

18-66 

18-75 

GO—70 

39 

20-66 

20-86 

41 

18-37 

18-63 

70—80 

20 

20-25 

20*47 

20 

17*97 

18-09 

Over 80 

8 

18-97 

18-62 

5 

17-20 

1 

17-39 


* 100 grs. of fluid may pass out of an extracted brain and it oan lose 5 grs. by evaporation per two 
hours. 

t See for example Byron Bramwell in the Lancet , 1899, Vol. i. pp. 147S—9. “In perfectly healthy 
right-handed persons who do not inherit a tendency to left-handedness, the driving or leading speech 
centres are (with perhaps rare exceptions, but I know of no recorded cases which definitely prove thiB) 
situated in the left hemisphere of the brain; and vice versa in left-handed persons the leading or driving 
speech centres are so far as we know usually but probably less constantly situated in the right 
hemisphere." 

t Vol. oli. pp. 241—262. 
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In all fourteen classes, except that of men over 80 years of age, the mean weight 
of the left hemisphere exceeds that of the right. If we exclude individuals over 80 
years of age, we have 



Mean for Men 

Mean for Women 

No. 

Eight 

Left 

No. 

Bight 

Left 

028. ... 
grs. ... 

| 282 | 

20*137 

570*55 

20-322 

576*79 

j- 224 | 

18*482 

523*66 

18*662 

528*76 


These numbers might be considered by some as fairly conclusive, although no 
probable errors arc given. But when we find directly opposed results given by later 
writers, wc wish that some details of his procedure had been provided by Boyd. 
Could he possibly have measured his left hemisphere usually before the right? 

Wagner, in a work* published in the year following Boyd's, found in the case, 
however, of only 18 brains: Right hemisphere 427 grs., Left hemisphere 426 grs., 
the latter having a slightly less mean weight; but to judge by his average weights 
his method must have been wholly different from Boyd’s. Thurnam in 1866 mea¬ 
sured the brains of 257 males and 218 females and in all but two cases the cerebral 
hemispheres apartf. He found for the mean male right hemisphere 670*63 grs.—a 
value extraordinarily close to Boyd’s 570*55 grs.—but his value for the male left 
hemisphere was only 569*78 grs., slightly less than that for the right, but really 
insignificantly different; the excess of right, however, was present in six out of eight 
age groups. For the females the mean right hemisphere weighed 511*13 grs. and 
the left again less—510*85 grs. In the age groups two means were equal, three in 
excess for the right and three in excess for the left. Thus Thurnam’s results by no 
means confirmed Boyd’s, but seemed to indicate if anything equality in the hemi¬ 
spheres as far as weight was concerned. 

Broca appears very early to have cast in his lot with the supporters of the pre¬ 
eminence of the left hemisphere. Broca’s actual measurements have we believe 
never been published and we owe the account of them to his pupil TopinardJ. 
Broca is said to have weighed 264 male and 139 female brains, and to have found 
the mean of the right hemisphere greater than that of the left by 1*93 grs. in men 
and 0*03 gr. in women. Thus Broca’s results seem to accord better with those of 
Thurnam than with those of Boyd. The fact which Broca states—that on 19 out of 
20 occasions the lesions which produce aphasia are on the left—led him to insist that 
the left side of the brain is that which works by preference. Broca added that manual 
dextrality in virtue of 

Pentre-croisement des faieceaux mcdullaires dans la raoelle allonges est une autre preuve du 
fonctionnement plus ordinaire du cerveau gauche. 

* Vorstudien dee menschlichen Gehime % 1862, Bd. u. §§89—92. 
t Journal of Mental Science , April, 1866, 
t J&limentt d'Anthropologic ginirale , Paris, 1885, p. 581. 
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Not daunted by the failure to show the left hemisphere the heavier, Broca pro¬ 
ceeded to the harder task of separating the lobes of the hemispheres and weighing 
each separately. He divided into frontal, temporo-parietal and occipital lobes, and 
found the following results for mean values: 

Weights of Men (358) Women (136) ' 

Bight - Left Frontal Lobes -2 50 grs. -1*50 grs. 

n n Temporo-parietal Lobes +1*02 grs. + *80 gr. 

„ „ Occipital Loboe +1*57 grs, + *03 gr. 

Thus according to Broca the male frontal lobe is heavier on the l$ft by 2*50 grs* 
and the remainder of the hemisphere heavier on the right by 8*49 grs. We cannot test 
the significance of these results, because no probable errors can be found without 
the individual measurements. Topinard says that Broca’s register indicates that the 
weights of the frontal lobes were equal in 12 cases, the left in excess in 186 and 
the right in 94, the total of which, 242, does not accord with the 258 male cases in 
which we are told the frontal lobe was weighed. Topinard (p. 584) tells us that 
with these extracts from Broca’s register he has confirmed the results drawn from 
the two series, one from Bicfitre consisting of 19 subjects and the other from Saint- 
Antoine of 18 subjects actually published by Broca, for therein Broca found the right 
hemisphere somewhat heavier than the left as a whole, but the left frontal lobe in 
excess by “une quantity tr&s notable *' Topinard gives no figures and the numbers 
were far from adequate to base any sound conclusions on*. Topinard, however, 
# The data appear in the Bulletin* de la SociM d'Anthropologic , 2 stale, 1875, pp. 584—6, in a paper 
under Broca’s name entitled: “Sur lee poids relatifs des deux hemispheres otaCbraux et de leurs lobes 
frontaux.” The following results are given* 

Hemisphere Frontal Lobe 



Hospioe de Bicfitre (mean age 62 years a months) 

19 males. 581*81 580*84 227*57 282*10 

HOpital Saint-Antoine (mean age 50 years 1 month) 

18 males: 575*88 574*89 245*05 248 50 

In this paper Broca says that since 1861, when he recognised that the faculty of language is localised 
in the third frontal convolution of the left hemisphere, he had weighed separately the two hemispheres 
and their principal components in all the autopsies he had made at the hospitals. He had made, he 
said, 440 detailed observations, which filled three great registers and awaited reduction. Meanwhile, he 
gives an abstract of the eases (87') cited above. In the discussion Broca was asked if hs thought the 
greater weight of the left frontal lobe was due to the part played by the thud convolution. He replied 
that the frontal lobe contained more than this convolution, but that the third convolution was situated 
at the level and behind the small cranial region, which is called the ptenon; he considered that the 
size of the pterion depends in part on the volume of the third convolution, and he asserted that the 
mean value of the left ptenon is a little larger than the right. Broca gave no figures, and the problem 
would be an interesting one if a satisfactory measure of the pterion could he derived. Pressed further as 
to whether he thought the influence of the third convolution oould affect the development of the frontal 
lobe, Broca replied that he was 14 tout dispose 4 eroire i oette influence ” (p. 586). BertiUon remarked that 
there were probably cerebral dextralists and sinistrahsts as there were manual swistralistsand dextralists. 

Hr Morant kindly measured for us the arc from the krotaphion to the sphenion on the left and right 
sides of 65 Egyptian male crania of the 26th to 80th Dynasties. This arc seems a reasonable measure 
of the sue of the pterion. The right arc was greater in 28 cases, the left in 85 and there was equality In 
8 eases. The mean value of the right are was 1X477**8741, and of the left are 18*195**8599. The dif¬ 
ference of the means, left minus right, was *518* *619, or foe difference was, on the number of skulls 
measured, insignificant. On these crania accordingly it was not possible to confirm Broca’s opinion that 
the pterionio area was greater on the left side. 

Biometrika xxz 
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clearly holds that excess of weight is a measure of excess of functioning, and believes 
that the brain is left-handed for certain functions and right-handed for others, 
a point of view adopted later by Riese as we shall note below. 

It will be seen that Broca did not base a pre-eminence of the left hemisphere 
on greater general weight. Topinard would apparently account for the discrepancy 
between Boyd and Thurnam on the ground of the latter having measured the brains 
of the insane; this in fact Broca was also doing in his 1875 paper, and it must be 
borne in mind when considering later Braune/s data. Meanwhile mention must be 
made of two English investigators, Bastian and Ogle. The characteristic of the first is 
cautious statement accompanied by hesitation to push undemonstrated theories to 
extremes; the tendency of the latter was to seize an hypothesis which, if true, would 
much simplify our ideas and press it forward before its foundations were well 
established. It is not our purpose to deal with the whole of the wide literature 
discussing the “ pre-eminence of the left hemisphere,” still less with that of the 
still more extensive topic of the localisation of function. But as some of the doubts 
expressed by Bastian are again coming to the fore, it may be well to recall them to 
the reader. Bastian wrote two years before Ogle’s paper*: ‘‘lam therefore strongly 
inclined still to believe in the similarity of function and practical equality of 
education of the two cerebral hemispheres, notwithstanding all that has been said 
of late in opposition to this doctrine,” and later on in the same paper he continues: 

In short, if anything like localisation of function is possible in the cerebral hemispheres, 
then J believe it would occur, and could be accounted for, rather in this way: that insomuch as 
wo have certain distinct avenues of knowledge (through the Sense Organs and their proximate 
nerve ganglia), and that the cerebral hemispheres are the parts concerned in the elaboration of 
impressions so derived t, wo can well understand that the impressions entering through one gate 
or sense-avenue, may pass through the substance and towards the periphery of these Cerebral 
hemispheres in certain definite directions, and according to accustomed routes. Then, the im¬ 
pressions entering through another gate of knowledge, or avenue of sense, may, and probably do, 
pursuo a different direction through its substance, so that at the periphery the fibres aud cells 
concerned in the condition and elaboration of these impressions may exist in maximum quantity 
in different portions of the surface of the hemispheres—though in i>art they may occupy jointly 
the same area, and be intertwined with the fibres and cells concerned in the elaboration of the 
previously mentioned set of impressions. And so on with the various sense organs and their 
ultimate expansions in the form of what I would call “Perceptive centres” in the cerebral hemi¬ 
spheres. Thus, though there may be much overlapping of areas, and though the area pertaining 
to the impressions of any particular sense in the cerebral hemispheres may be a very extended 
one (not to speak of the still further complication brought about by the communication established 
between the nerve cells of one sense area with those of others in the same hemisphere, and of the 
probable union by means of commissural fibres between analogous parts of the two hemispheres), 
still it may well he that certain portions of the surface of the cerebral hemispheres might corre¬ 
spond more especially to the maximum amount of nerve cells and fibres pertaining to some one 
or other of the various senses. I should expect, therefore, that the parts concerned in the pro¬ 
duction of the emotional feelings related to any particular sense or senses, as well as in the 

* Journal of Mental Science , January, 1869, “Note on the Localisation of Function in the‘Cerebral 
Hemispheres. 1 ” 

t “ Converting them in fact into what I call Perceptions—UBing this term in its ordinary psychological 
acceptation.” (Footnote by Bastian.) 
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production of volitional stimuli to which these might give rise, would be those parts of the Con¬ 
volutional grey matter that represented, as it were, the Perceptive Centres of the senses in 
question. 

In his lectures of six years later* Bastian takes a somewhat modified view, but 
is still unprepared to give absolute predominance to the left hemisphere. Here he 
admits that in the great majority of cases in which a right-sided paralysis ac¬ 
companied by aphasia presents itself it is produced by a lesion of the left hemi¬ 
sphere. “ In fact a long series of observations has compelled us to recognize the 
greatly superior activity of the left hemisphere, as compared with the right, in 
initiating motor acts subservient to intellectual expression. Just as the left hemi¬ 
sphere has undoubtedly to initiate the muscular acts by which writing is effected in 
right-handed individuals so it would appear that from this same half of the brain 
the incitations habitually pass over which are destined to excite the motor acts of 
speech—even though the muscles concerned are bilaterally disposed, and always act 
in concert on two sides of the larynx, fauces, tongue and lips ” (pp. 205—6). 

Bastian suggests that this initiatory action of the left hemisphere in relation to 
speech-movements may be connected with a slight precedence in its development, 
and this itself be a more or less remote consequence of an inherited tendency to 
right-handedness (p. 20(1). He considers that the situation in the left hemisphere 
affected in aphasic individuals may be (i) in or around the 3rd frontal convolution, 

(ii) in the white substance between this convolution and the left corpus striatum, or 

(iii) in the latter itself. On the whole, he prefers the third or Broca’s convolution f 
as the source of the volitional stimuli which incite motor acts of speech; it is a 
portion of the brain having intimate functional relations with many other parts of 
the hemisphere. 

Bastian then turns to certain difficulties attached to a theory of absolute 
dominance of one hemisphere; he recalls the fact that there seem to be de¬ 
finite cases where aphasia occurring with left-handed persons goes with left rather 
than right-sided hemiplegia. Further, he considers that while the left hemisphere 
is more especially concerned in the performance of voluntary motor acts of speech 
it would appear that there are also certain peculiarities of the function pertaining 
more especially to the right hemisphere. Lesions on that side are more frequently 
and rapidly fatal than those of the left and hemiplegic symptoms more severe and 
lasting. Lesions of the right hemisphere lead more frequently than those of the 
opposite side to disorders of nutrition (p. 210), and further, hysterical paralysis 
occurs more frequently in left than right limbsj. Another noteworthy point is that 

* H. Charlton Bastian: On Paralysis from Brain Ditease in its Common Forms, London, 1875. See 
especially Lecture IV* 

f The inferior frontal convolution on the left has been given Broca’s name. 

t Brown-S4quard recorded left side paralysed in 97 as against right side in only 24 instances. 
N. Saveliew (*• Gehirnembolie,” Virchows Arehiv , Bd. 185, p. 121) states that the left hemisphere is the 
chosen locality for embolisms. But this does not follow from his own data of 104 oaseB in which 29 were 
on right side, 89 bilateral, and 80 left sided. What we have to compare are 29 B. and 86 L. occurring 
in 65 oases; the mean for indifference would be 82*5, Mid the difference 8*5 does not seem significant 


8—2 
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lesions of the right hemisphere are more apt to give rise to paralysis or convulsions 
on the same side of limbs or face than in the case of lesions of the left hemisphere, 
and such cases Bastian holds cannot all be due to error of record though perhaps 
some are. “ The occurrence of paralysis or convulsion on the same side as the brain 
lesion is with our present state of knowledge quite inexplicable; still more mys¬ 
terious when we find one hemisphere apparently more apt than the other to 
produce such an anomaly 99 (p. 212). 

If Bastian’s views tend, as we think, to a differentiation of function rather than 
a dominance of the left hemisphere, there appears little reason ft priori to anti¬ 
cipate a greater weight or size in the left hemisphere. 

We now pass to Dr William Ogle, whose paper, entitled “On Dextral Pre¬ 
eminence/” was published in 1871 *. It is by no means clear whether Ogle in 
speaking of pre-eminence of the left hemisphere intends to attribute to it the 
same kind of dominance in mental functioning that the right hand usually has in 
manual, but his statements suggest this. 

He begins by speaking of dextrality as a characteristic peculiar to many forms 
of life. In man he found 57 men out of 1000 or 5‘7 %, and 28 women out of 1000 
or 2*8 °/ Q manual sinistralists, a conclusion pointing in the same direction, but not 
as sweeping as Hippocrates’ yvvrj ovSe/ila dfi<f>cS4^io^ Ogle says that out of 23 
monkeys in the Zoological Gardens, 20 were right and only 3 were left-handed, but 
he admits the experiment requires great caution. He states that the leg on which 
a parrot stands when it is given a nut is always the same, and that of 86 parrots, 
63 stood on the right and 23 on the left leg—a fair Mendelian quarter! He meets 
the objection that 63 out of the 86 parrots took the nut in their left claw and so 
were sinistralists by the statement that they must select a leg to stand on before 
they can take the nut. But the mental process of the parrot might be: “ Here is a 
nut, I must take it with my more efficient left claw, and so I will stand on my right 
leg.” Ogle following Dax asserts that the mental faculties concerned in speech are 
lodged in the left cerebral hemisphere (p. 292). In 1867 f Ogle had suggested that 
in left-handed men they would be lodged on the right. In 100 cases with palsy, 97 
had palsy on the right and were right-handed, 3 had palsy on the left and were left- 
handed. On the basis of this “ there can remain no fair doubt that right-handedness 
depends on some predominance of the left brain, and left-handedness, when it occurs, 
on a transposition of this structural peculiarity whatever it may be ” (p. 292). 

Ogle next cites Boyd's measurements on weights of the two hemispheres, and 
quotes Bastian's results as to the specific gravity of grey matter on the left side being 

on 65 cases. The data of Straass (22 cases) and Butin (88 oases) do not seem comparable with Saveliew’s, 
for they contain not a single instance of bilateral embolism wbioh occurs in 84 °/ 0 of Saveliew’s oases. 
For the same reason Meissner’s 82 cases showing only 10 °/ 0 of bilateral attaok are hardly comparable. 

♦ Medico-Chirurgical Transactions (Roy. Med . and Chir . SocX Vol. xxxv. 2nd Series, 1871, pp. 279— 
801. 

t St George's Hospital Reports t Vol. n. p. 122,1867. 



M. F. Hoadlkt 


101 


higher than on the right 0 in order to pass from a pre-eminence in functioning to 
a predominance in physical characters. There is no reason really, if there be a pre¬ 
eminence in functioning, which might solely connote a pre-eminence of commissures, 
why this should involve a preponderance in size or weight, or even in the complexity 
of convolutions, which, however, Ogle asserts that Dr Broadbent and he have both 
found for the frontal convolutions on the left side. The left hemisphere is not only, 
Ogle tells us, heavier, but more highly developed than its fellow, and this is the 
explanation of dextral pre-eminence. To prove his point Ogle obtained the brains 
of two left-handed women and submitted them to Dr Broadbent; the latter reported 
on them with drawings (not reproduced), and the conclusion is stated thus: “The 
ordinary conditions of the two hemispheres were in each of these brains reversed, 
the greater complexity of convolution occurring in both on the right side and not 
on the left” as—Ogle adds—“I had anticipated” (pp. 294—6). 

It will be seen that the ordinary condition, i.e. greater convolutional complexity 
on left, is here assumed to be proven. Broca himself—without providing actual 
statistics—had limited his assertion to the frontal lobe: “The convolutions are 
more numerous in the left frontal lobe than in the right, and the converse condition 
exists in the occipital lobes, where the right is richer in convolutions than the 
left.” 

It is clear that if we may suppose, as indeed we know, that man is bilaterally 
asymmetrical, then the odds are only three to one that a pair of left-handed women 
will exhibit pre-eminent hemispheres both on the right side. The data are far too 
slender to carry complete conviction. To meet the objection that the greater 
development of the left brain may be the consequence not the cause of the greater 
use of the right side, Ogle cites Gratiolet’s remarks, that the convolutions of the 


* Bastian’s results are given in a paper, “ On the Specific Gravity of different parts of the Human 
Brain,” Journal of Mental Seience t No. 66, January 1866, p. 29. They seem to have been averaged for 
27 brainB, and the results are as foUows: 

Averages of Grey Matter 

Convolutions Left Hemisphere Bight Hemisphere 
Frontal 1*0291 1*0276 

Parietal 1-0800 1-0296 

Occipital 1*0320 1-0816 

How far these results are of real significance it is not possible to say as no statement is given as to 
the variation due either to errors of measurement or to random sampling. 

For the sane BaBtian found for averages: 

Specific gravity of grey matter: Left side 1*0800, Right side 1-0296. 

For the insane, without regard to side, 1*0825. 

The specific gravity of white matter was always higher than that of grey matter, and without dis¬ 
tinction of side was 1*0404 for sane and 1*0405 for] insane. Our author did not find the heaviest or 
lightest brains with the highest or lowest speoifio gravities. 

Danilewsky (Medicinuches Centralblatt , 1880, No. 14, April 8), taking three brains only, considered 
the grey and white substances of the brain separately, also dealing with their specific gravity. He found 
the distribution of grey and white substances in the two hemispheres neatly alike (“nahesu gleich”). 
The differences between the speoifio gravities obtained by different observers are all of the order of the 
differences recorded by Bastian as existing for right and left hemispheres or for sane and insane. 
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left frontal lobe appear earlier in the foetus than the corresponding convolutions on 
the right. As Gratiolet has been frequently quoted as in some way confirming the 
pre-eminence of the left hemisphere, we searched his book* and could only discover 
a single very modest paragraph on this point, where he is discussing the develop¬ 
ment of the brain: 

II m’a sembld, par suite d’une s^rie d’observations consciencieusement <$tudi<5es, que les deux 
hdmisphbres ne so ddveloppaienfc pas d’une manibre absolument symdtrique. Ainsi lo ddveloppe- 
ment des plis frontaux paralt se faire plus vitc k gauche qu’& droito, tandis que l’inverse a lieu 
pour leH plis du lobe occipito-sphluoidal. Du moins, dans tous Ioh cas que j’ai observes, ai-je vu 
la scissure parallMe qui distingue le pli marginal inferiour se dessiner k droite avant de se montrer 
k gauche, (p. 241.) 

Here is no dogmatic assertion as to pre-eminence of the left hemisphere; in 
certain districts Gratiolet thinks he has observed the left, in others the right develop 
earlier ; he makes no statement that earliei' development is accompanied by greater 
complexity. Ecker, a first-class anatomist, dealing with the development of the 
furrows and convolutions of the brain in the human foetusf, states that he had 
found in foetal twins, in the one both frontal furrows present, in the other the first 
frontal furrow wholly absent on the left That the left side always precedes the 
right in the development of furrows and convolutions, “as Gratiolet has asserted,’* 
Ecker could in no way confirm. But Gratiolet really made no sweeping assertion 
of this kind. 

We now return to Ogle and cite the following words, in which we are respon¬ 
sible for those italicised: 

Seeing, however, that wo know, if the arguments I have used in the earlier part of this paper 
be valid, that some or other anatomical difference between the two sides must precede the right- 
handedness, and moreover that this difference must bo somewhere in the brain (for how other¬ 
wise can the facts I have brought forward concerning aphasia be explained?) it appears to me 
only rational to suppose , when one finds such an anatomical difference between the two hemi¬ 
spheres as that now revealed , that this anatomical difference is the antecedent for which one was 
searching, (pp. 295—6.) 

This paragraph begs two questions: (i) whether the difference of function must 
necessarily involve macroscopic differences, and (ii) whether the evidence of such 
differences existing has really been provided. 

Ogle now proceeds to settle what has given rise to this “ pre-eminence of the 
left side of the brain.’* He examined the cervical vessels and says that he found 
in 12 out of 17 cases of right-handed men the common or internal carotid was 
larger on the left than the right. He admits that the interpretation of the larger 
left carotid is rather dubious J, but tries to strengthen his result by saying that 
the left carotid is also less tortuous and so blood will flow more abundantly to the 

* F. Leuret et P. Gratiolet: Anatomie compare du Systlme , consider4 dans set Rapports avec VIn¬ 
telligence, T. ii (Gratiolet), pp. 241—2, 1857. 

t Archiv filr Antkropologie, Bd. m. S. 215, 1868. 

X If there be no essential differentiation, then the anticipated number would be 8*5, while the ob¬ 
served number is 12, a deviation of 8*5, whioh is only 2*5 times its probable error of 1*89. The result is 
therefore dubious statistically as well as anatomically. 



M. F. Hoadlby 


10a 

left hemisphere and to this he would attribute its [supposed] greater development*. 
Naturally persons with the viscera inverted should be left-handed. This is not 
universally the case; no statistics are, however, provided. Cases of inverted viscera 
and right-handedness Ogle attributes to education in dexfcrality. He cites G. St 
Hilaire’s statement that inversion of viscera is more common in men than women, 
and holds that this is in accordance with manual sinistrality being more common 
with men than women. Finally he says that carotids in parrots confirm his views 
(see our p. 100). 

Such is the memoir of Ogle, which created much interest in its day, and spread 
almost as a dogma the view that the left hemisphere is pre-eminent, without de¬ 
fining in which of many characters the pre-eminence is supposed to exist f! 

We are not able in this slight study to consider all the papers that have been 
published on this very obscure problem, but before we turn to quite recent work 
we may refer to a paper by Wilhelm Braune, entitled: “Das Gewichtsverhaltniss 
der rechten zur linken Himh&lfte beim Menschen” (Archiv fur Anatomie und 
Entwickelungsgeschickte , Jhg. 1891, S. 253—270). This is a good paper not only 
for its references and criticism of earlier works, but to a certain extent for its new 
data. Braune considers that up to 1891 no definite dominance had been proved for 
either hemisphere, and holds that the observed differences were within the error- 
limits. We do not feel clear that he distinguished between the errors of random 
sampling and of measurement; or between the error of the mean and that of an 
individual measurement. Neither he nor any of his predecessors makes the slightest 
investigation of what these error-limits may be. He considers that if the result of 
the “Nervenfaserkreuzung” produces for dextralists a greater development of the 
left hemisphere, then manual dextrality should always to some degree cause excess 
of the left hemisphere or at least some part of it, and there should only be excess 
of the right in cases of left-handedness. This he states is in nowise true (S. 259). 

Braune’s 100 weighings were made on hospital cadavers, partly by himself and 
partly by other anatomists, and he says they were made in uniform or standard 
manner, and included no insane or criminals. The brain as a whole was measured 
and also its parts. We are compelled regretfully to doubt his statement as to the 
absolute standardisation of methods of measurement, because in the cases of some 

* Moutier (cited by Bonvicini, Wiener medicinuehe Wochenachrift , No, 23,1920) argues in favour of 
asymmetry, but did not find the left side vascular system pre-eminent. 

t A good illustration of this occurs in a paper by Hasse, “Ueber Gesiohtsasymmetrie ” (Archiv fUr 
Anatomie , 1887, S. 124). Hasse is very properly defending the Greek sculptors against an anatomist's 
charge that they were not true to nature, because their creations are not symmetrical. No human 
being is symmetrical, he says. But he then goes on to attribute this asymmetry to the dominance of 
one hemisphere, and this the left one; this dominance is a sequence to its greater volume , which is again 
the result of greater muscular development of the right side due to a greater use of the right upper part of 
the body (!). In the same way Lombroso speaks of the left hemisphere in normal persons as being more 
pre-eminent than the right in both weight and complexity of convolutions. He then proceeds to state 
that criminals reverse this ordinary rule of normal persons, and says that among criminals 41 °/ 0 were 
asymmetrical on right, 20 °/ 0 on left and in 38°/ 0 the heads were equal, that is we suppose symmetrical. 
It is not easy to understand how if the head be asymmetrical it can be described as asymmetrical on one 
side rather than the other. 
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of his contributors the sum of the weights of the parts nearly approaches the weight 
of the whole, while in other cases that sum may differ by as much as 100 grs.* It 
looks therefore suspiciously as if some of the anatomists had weighed the brain as 
a whole before and others after separation into parts. Further as nine of the hun¬ 
dred cases were suicides, one died of chronic alcoholism, and two in the “Zuohthaus” 
(besides the failure to state that the remainder had never been in asylum or jail), 
we cannot convince ourselves that Braune’s statement as to the absence of the 
insane and of criminals is wholly correct, or that his material is thereby raised above 
that of Boyd, Thurnam, or Broca. 

Braune has not given the means of the several series of measurements, but these 
we have taken with the following results: 

Encephalon Cerebrum Cerebellum 

Right Half Left Half Right Half Left Half Right Half Left Half 

630*62 grs. * 029*99 grs. 551*23 grs. 649*66 grs. 78*93 grs. 79*76 grs. 

Or, the right encephalic hemisphere is 0*63 gr. in excess, the right cerebral hemi¬ 
sphere is 1*57 grs. in excess and the right cerebellar hemisphere 0*83 gr. in defect. 
Even should any of these differences prove to be significant t it is very hard to 
believe that they are sufficient to indicate a higher development or pre-eminence 
on either side of the brain. 

Braune gives the above results in the form: 

Encephalon: R. excess 47. Total excess 267*98 grs. Average 5*70 grs. 

L. „ 52. „ „ 213*2 grs. „ 4*10 grs. 

One case of equality. 

Cerebrum: R. excess 54. Total excess 273*4 grs. Average 5*06 gi*s. 

L. „ 37. „ „ 129*0 grs. „ 3*49 grs. 

One case of equality and eight not measured. 

Cerebellum: R. excess 33. Total excess 8575 grs. Average 2*60 grs. 

L. „ 54. „ „ 168*55 grs. „ *3*12 grs. 

Five cases of equality, and eight not measured. 

Braune emphasises this excess of the left cerebellar hemisphere, and says that 
the predominance in weight of the left cerebellar hemisphere corresponds with a 
53*8 °/ 0 of bulging which he found in 91 skulls of various races he examined. The 
case is more in favour of left bulging than he credits it with being, for he has 
counted those with no bulge as if they bulged to the right. He says that 12 cases 
occurred in which the right hemisphere weighed 10 grs., or more, in excess of the 
left, but none of those was associated with left-handedness. 

On the whole Braune’s paper is good, when we consider how little appreciation 
most anatomists have of the need of statistical reduction. Braune concludes by 

* Let the reader examine the relative equality of the sum of the parts and (he whole after Brain 
No. 86, and note how little equality there is before No. 86. Note especially No. 25. 

t We have determined the probable from the mean errors based on the excess differences of B. and 
L. values published by Braune. We reach the results 0*63 :fc *41, 1*57 ± *35, 0*83 ± *22. Thus in the 
oases of the cerebrum and the cerebellum significant differences can possibly be said to exist in weight 
between B. and L. sides, for these differences are 4*5 and 8*8 times their probable errors. 
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stating that if size or weight be causally associated with the unequal muscular 
division on the two sides of the body then this asymmetry ought constantly to 
follow the muscle and bone distribution, but this is not the foot. Clearly we are 
justified in saying that the work of Thuraam and Braune is at least as weighty ns 
that of Boyd and Ogle. \ 

Braunes data (if not his statements) confirm Thurnams results that die right 
hemisphere is slightly heavier than the left, and they support the view of Wilde, 
although less emphatically, that the left cerebellum is heavier than the right (see our 
p. 96 and p. 109). 

We will now pass to the more recent literature and consider to what extent it 
throws further light on the nature of this “pre-eminence of the left hemisphere. 91 
In the first place we have an inaugural dissertation by Eugen Riibel entitled: 
Ueber das Gewicht der rechten u. Unken Qrosshim-Hemisph&re im gesunden und 
kranken Zustand , Wttrzburg, 1908. Our author begins with an account of other 
investigators’ work and enlarges on the difficulty of the problem, the dividing of 
the brain, the drainage of the fluid and so forth. He considers they have come 
to opposed conclusions owing to imperfections and want of standardisation in their 
methods of separating the hemispheres. But he gives no evidence that other anato¬ 
mists have used less care than himself, nor does he explain in detail his own method 
of procedure. He says that, up to his own investigations, no author had in the 
problem of the relative weight of the hemispheres taken into consideration the 
skull capacity. He remarks: 

Wenndie Differenz in Prozentenzwischen Schadelkapazitat und Hirngewioht eine pathologische 
ist, so muss man nach derselben annehmen, dass das Gehim selbst in seiner Materie, durch eine 
akute oder chronische Gehimkrankheit verandert war. Eine aufallige Gewichtediflerenz der 
Hemispharen ware dann natiirlich in erater Linie ais Folge der Gehimkrankheit anzusehen. 
(S.8.) 

Riibel now proceeds to table his cases according to the percentage difference be¬ 
tween “Schadelkapazitat und Hirngewicht,” which reads as a difference between a 
volume and a weight! He neither explains how he reduces volume to weight, nor how 
he has determined the skull capacities of the subjects from whom the brains have been 
extracted. We have no statement or numbers supplied for the reduction, and as the 
specific gravities of the grey and white matters differ, and their relative proportions 
vary from brain to brain, and further the lateral ventricles differ in size, surely some 
detailed explanation is essential for the proper understanding of tables which speak 
merely of the percentage difference between skull capacity and weight. The author 
concludes that when there is no one-sided influence of disease both hemispheres are 
of equal weight. He thus appears to have reached the same point as Braune, but 
we have no idea of how he gets there and feel desperately inclined to echo the 
Schlagwort: “Weg mit Dissertationen!” 

The next writer we have to notice is far more suggestive. M. Inglessis proceeded 
to take “frontal sections" of hardened brains*. These sections were taken 8 to 

* “Untersuchtingen fiber Symmetric und Asymmetric der mensohliohen Grothirnhemiipharen. ’ ’ 
Zeittehrift filr die geeamte Neurologic und Ptgchiatrie , Bd. xcv. 8. 464—479. 1994. 
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10 mm. apart parallel to the plane through the auricular axis perpendicular to 
"Rieger's horizontal plane Photographs were then taken of the faces of these 
slabs and the areas right and left measured on the photographs by aid of a plani- 
meter. No attempt seems to have been made to reconstruct the volume of the 
hemispheres from the known thickness of the slabs and the areas. It may be re* 
marked that asymmetry in the positions of the auricular passages would produce 
artificial asymmetry of the sections. Taking 3 % difference as negligible our author 
proceeds to measure the percentage difference of his hemispheres; he does not as 
far as we can see state on what his percentage difference is measured. We take it 
that symmetry means in his case equality of areas, and that he terms that side 
“asymmetrical” which has the greater area. He divides the brain into a “forward” 
part, i.e. that in front of the standard plane through the auricular axis, and a 
rearward part, namely that behind this plane. On the basis of measurements for 
200 brains Inglessis found: 

Forward part symmetrical, rearward part symmetrical, 13 : 6°/ 0 . 

„ „ „ „ „ asymmetrical, 168 : 84 °/o* 

„ „ asymmetrical, „ „ 'symmetrical, 0 : 0 %. 

„ ,, „ „ „ asymmetrical, 19 : 5 °/ 0 , 

Thus of the forward parts 181 were symmetrical and only 19 asymmetrical, but of 
the rearward parts 187 were asymmetrical and only 13 symmetrical. The 19 brains 
with asymmetrical forward parts were chiefly those of persons with organic brain 
disease, in particular, paralysis. Age and sex seemed to have little influence on the 
asymmetry, and the influence of disease was not definitely significant for this 
number of brains. 

It will be seen that if we can trust Inglessis’ results the asymmetry was practically 
confined to the occipital lobes, and we must answer the question put to Broca as 
to whether the enlarged frontal portion of the left hemisphere was due to the organ 
of speech being on the left side, by saying that the frontal portion of the left 
hemisphere is not greater than that of the right, and thus it is idle to question 
whether it is due to the pre-eminence of the left third frontal convolution behind 
the pterion. Inglessis thus directly contradicts Broca’s view. Turning now to right 
and left asymmetries, i.e. to which is larger, Inglessis finds that of the 19 forward 
asymmetries 11 were left and 8 right, i.e. the difference is not significant, but of 
the 187 cases of rearward asymmetries 161 were left and only 16 right. Thus 
according to our author the brain is asymmetrical, but this asymmetry is confined 
chiefly to the occipital part of the hemispheres and here the left hemisphere pre¬ 
dominates. If we combine the conclusions of Braune and Inglessis we should accept 
a predominance of the left side of the encephalon in the occipital portion of the 
cerebrum and in the cerebellum. But if this be true, is it needful to associate it 
with differentiation of psychical function in the two hemispheres? May it not be 
that a majority of persons having their hearts on the left side find it easier to sleep 
for the major portion of the night on the right side ? While the brain is growing this 

* For a definition of this plane the author refers to an inaugural dissertation of Gertrud Wolf: 
Ueber die Lage der Ohrachee , in JBeziehnng zum Schddel und Qchirtu Wlira burg, 1918. 
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must give greater pressure on the occipital than on the frontal portion of the right 
hemisphere, while the left hemisphere would be free of such pressure. Hence 
might arise the asymmetry observed by Braune and Inglessis. We do not press the 
point, but it seems as worthy of consideration as any theory of left pre-eminence. 
It is also consistent with the view of H. Reichardt*, who holds that there i&no 
sensible difference between the weights of right and left hemispheres even when 
the skull is asymmetrical. 

We now turn to a second memoir by Inglessis entitled: "Ueber Kapazit&ts- 
unterschiede der linken und rechten Halfte am Schadel bei Menschen (insbesondere 
Geisteskranken) und iiber Hirnasymmetrien fThe object of this paper is to show 
that there is an extensive agreement between the skull and the brain sides which 
predominate (“iiberwiegen"). Inglessis does not appear to have had the whole skulls 
from which the brains were extracted, but only the skull-caps (“Kalotten”) which 
were removed for the purpose of extracting the brains. These caps were taken off by 
saw-cut; and he says that they were sawn off at the plane going through the upper 
borders of the orbits and the uppermost points of the auricular passages. Now our 
own experience shows us that the upper borders of the orbits and the two auricular 
points in the vast majority of crania do not lie in one plane, and to take off a skull¬ 
cap with any exactitude in this way would be a task of the greatest difficulty 
demanding an accuracy hardly attainable in the post-mortem room. Inglessis 
considers the skull-cap thus obtained gives the most important portion of the 
cerebrum. He then divides the skull-cap into the right and left halves by a sheet 
of lead. This is taken through the crista galli, in the concavity through the sulcus 
longitudinalis superior, and posteriorly through the “ middle ” of the protuberantia 
occipitalis. He does not say how he cut the lead sheet to fit the individual skull¬ 
cap, or how, having done so, the right and left capacities were determined by 
water up to a border which could not be uniplanar. The whole process seems an 
extraordinarily difficult and necessarily rough one, but Inglessis concludes that the 
capacities thus determined agreed with the frontal section method in showing the pre¬ 
dominance of the left hemisphere; thus the predominance of the left hemisphere in 
the case of the skull-cap must have been in the capacity of the occipital region. Our 
author attributes in chief although not entirely the difference in capacities to the 
deviations of the falx cerebri to right or left J. 


Deviation of Falx to right Falx straight 

69-4 % 19-8% 

Asymmetry to left Symmetrical 

63-5% 6*3% 

Predominance of left in skull-cap Equality 
63*5% 5*2% 


Deviation of Falx to left 
20 * 8 % 

Asymmetry to right 
30*2% 

Predominance of right in skull-cap 

31*3% 


Lateral Ventricle greater on right Ventricles equal Lateral Ventricle greater on left 
66*8% 17*8% 15*4% 

* Arbeit am der ptychiatr . Klinik zu Wiirzburg, Heft L S. 44, und Heft vi. S. 341, 611. 
t Zeitechrift fiir die geeamte Neurologie und Peyehiatrie , Bd. lent 8. 854—378, 1925. 

As determined by thesulous longitudinalis superior deviating to right or left, but this does not appear 
to measure the full possibility of deviation in the falx cerebri. 
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The measurements were made on 96 skull-caps only. Our author terms this an 
"ausgedehnte Uebereinstimmung ira Ueberwiegen der linken Seite.” In women 
Inglessis found that the predominance of the left hemisphere was less; this should 
be correlated with more left-handedness in women than men, which does not appear 
to be the case. 

If we might trust Inglessis* method and results, they would give us confidence 
in the belief that internal measurements on the skull will more or less accurately 
describe what is the position with regard to the brain. 

A paper by Rasdolsky, entitled “The Asymmetry of the Hemispheres of the Brain 
in Man and the Animals,” appeared in 1925*. The author starts with the statement 
that the functions of speech, writing, reading, customary and expressive actions, the 
visual and auditory gnosis are only perfectly developed in man in one predominant 
hemisphere; and that except in man the two hemispheres are quite symmetrically 
organised. Presumably therefore manual laterality does not occur in apes, monkeys 
or parrots (see our p. 100). He attributes right-handedness to the position of the 
heart and the effect that the expenditure of much energy on the left side would have 
on the heart (described as the theory of Astuazaturoff and Weber). There are no 
experimental data given and the paper is a somewhat dogmatic attempt to account 
on a developmental basis for the pre-eminence of the left hemisphere which 
Rasdolsky considers proven. If apes and monkeys, having a left-sided situation of 
the heart, be considered as animals, is it correct to assert that their hemispheres 
are symmetrical and they have no manual laterality? 

In 1926 we have a paper by C. U. A. Kappers on “The relative Weight of the 
Brain Cortex in Human Races and in some Animals, and the Asymmetry of the 
Hemispheres f.” Here we are concerned with the weights of the grey matter in 
right and left hemispheres taken in relation to the total weight of the hemisphere 
and of the brain. Kappers* procedure is dearly a long and laborious one, and it 
could not be easily applied to determine in adequate numbers either the mean of 
the difference of these relative weights or the probable error of this difference. He 
applies it only to three Dutch and three Chinese brains which are far too few to 
really determine (i) average differences, (ii) any excess or defect in right or left 
hemisphere, or (iii) any differentiation of European and Chinese brains. 

J. Wilde in the same year, in a paper entitled “Ueber das Gewichtsverhaltniss 
der Hirnh&lften beim Menschen J,” returns to the old problem of weighing the two 
sides of the brain. He dealt with 200 brains of individuals from 17 to 77 years 
of age of which 125 were male and 75 female, and says that his greatest error as 
shown by control measurings was 0*25 gr. The brains were those of Letts, Germans 
and Russians. We have only seen Wallenberg*s account of the paper in the 
Zentralblatt /. d. gesammte Neurologie u. Psychiatrie (Bd. LV. S. 393—4), but the 
divisions and measurements seem to have been carefully made. The total weight 

# The Journal of Nervous and Mental Disease, Vol, lxh. pp. 119—182, N.Y. 1925. 

f Ibid . Vol. lxiv. pp. 118—124, N.Y. 1928. 

$ Paper from the Neurologic Institute of Biga. Latvijas Univ . raksti, Vol. xiv. pp. 271—288,1926. 
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of the right aide was somewhat the heavier, and this applied also to the cerebrum* 
On the other hand the (cerebellum + pons Yarolii + medulla oblongata) was heavier 
on the left side than the right. Thus this last brain weight paper appears to con* 
firm the results of Thurnam, Broca and Braune, who found the right hemisphere 
somewhat heavier (see our pp. 96,97,104), and of Braune again who found the cere¬ 
bellum heavier on the left (see our p, 104), and to confute those of Boyd (see our 
p. 95), who found the left-hand hemisphere heavier. Our author concludes that: 

“The weight of the brain is accordingly no witness to the functional superiority 
or inferiority of the brain (exception being made of that of the microcephalic idiot 
which falls below the normal limits).** 

Walther Biese in the following year (1927) published a paper* on: “Die tTber- 
wertigkeit der einen Hemisphere auf Grund hirnmorphologisoher und hirnpatho- 
logischer Untersuchungen.” He criticises strongly the view that superiority belongs 
to one hemisphere or to the other. He asserts that we must investigate in each 
locality which hemisphere has the greatest area of functioning substance, furrows 
and convolutions both being examined. It will be found in this way that one 
hemisphere may dominate for one tract and the other for another. Biese considers 
that all grades of superiority may occur and these may vary from one tract to a 
second. A manual dextralist may inherit only the laterality of one tract, and this 
give rise to a very complicated laterality in the brain. Our author cites with ap¬ 
proval Kleist, who considered that for any brain tract there might be sinistrality, 
equality or dextrality. Thus a genuine manual dextralist might on his supposed 
inferior right hemisphere actually have tracts of superior brain dextrality (S. 227—8). 
According to Biese (who does not cite Ogle, but appeals to a single case examined 
by Flechsig) there may be a superiority of blood vessels on the left side, but it is 
not yet proven f. 

Direct investigations of whether the left hemisphere is wholly directive have 
recently been made. We may refer first to B. A. Pfeifer, who has considered the matter 
in his “Bemerkungen zur Links- und BechtshandigkeitJ.” He starts by saying that if 
we accept the “ Aproxielehre” of Liepmann we are forced to the conclusion that the 
superiority of the left hemisphere almost entirely excludes any participation of the 
right in action. But observations on left and right-handedness teach us that both 
hemispheres influence each other in alternating manner in action; this Pfeifer 
says is very obvious in the use of bilateral innervatoiy identical musculature, as 
when the two hands make the same or opposed movements. In the use of bilateral 
innervatory identical musculature the anatomical certainty of the course of the 
motion is most remarkable, and we attribute it to the linkage of homologous 
districts in the brain by the nerve fibres of the corpus callosum (“Balkenfasem”) and 
thus the transit of innervatory impulses from one side to the identical innervatory 

* MonaU$chr\ft far Psychiatric und Neurologic, Bd. lxiv. S. 195—228. 

t Biese’s main material consisted of three brains of manual dextrailsta with disturbance of speech 
and right-handed lesions. In these cases he was able from convolutions and snlei to show morphological 
right aide superiority. 

X MUnehener mdidnuehe Wochcnechrift, Bd. lxxit. L B. 846,1987. 
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musculature of the other. Thus in writing with the right hand arises by way of 
latent co-practice the mirror writing of the left hand; bad writing of the left hand 
flows from this acquired tendency to mirror writing. From these and other instances 
and illustrations, Pfeifer draws the conclusion that the so-called dominance of the 
left is not independent of the corresponding tract on the right hemisphere. 

Some attempt to measure this reciprocal influence of the cerebral hemispheres 
has been recently made by J. Wysocki* and by Wysocki in conjunction with L. 
Zbyszewski *. Their process was associated with the phenomena termed by Brown- 
S6quard “dynainog4nie.” 

L’entre-croisement de doux excitations dans un groupo de cellules nervouses, a Pintersection 
de deux neurones, no determine pas toujours des actions d’arrOt, rnais peut donnor lieu, au contraire, 
k un renforcement de l’excitation. (he. cit. p. 1009.) 

The same class of phenomena has been termed by Exner “ Bahnung.” Choosing 
homologous motor tracts of the surface of the two hemispheres of the brain, our 
authorsexcited them in diverse ways, and found that these excitations had areciprocal 
influence. It would thus seem possible by exciting one hemisphere in a definite 
region to strengthen the effects which flow from exciting the other hemisphere in 
the same region, or there is some evidence to indicate that the two hemispheres to 
a greater or less extent cooperate. 

(ii) Our own Material . It will be seen from the previous analysis of papers 
dealing with the pre-eminence of one or other hemisphere that the earlier and 
somewhat dogmatic views have been largely called into question, and that the pre¬ 
eminence claimed for the left hemisphere has not been shown hitherto definitely to 
be associated with corresponding pre-eminence in such gross characters as weight 
or size. The measurements referred to in the first section of this paper permit a 
definite answer to be given to the relative pre-eminence of the hemispheres in one 
measurement of size, namely their length. The maximum lengths of the right and 
left sides of the cranial cavity were taken in 729 male Egyptian skulls. These lengths 
were taken on either side of the crista galli to points above the occipital protuberance, 
the instrument being held parallel to the median sagittal plane, and endeavours 
made to pass from the most anterior point of the frontal to the most posterior point 
of the occipital bone. Three trials were made to obtain the maximum on each side, 
and the greatest of these was selected as the maximum for each side. If these lengths 
be L r and L Lt then \ (I r + L l ) was the quantity used for correlation purposes in 
connection with the capacity of the skull. This quantity £ (L R + L l ) was entered on 
the card of the skull, and no record preserved of which component length was L R 
and which L l . The same system of measurements was again repeated for the whole 
729 crania, record being kept of L R and L l separately. The maximum values right 
and left were read to the nearest half millimetre and Fig. 1 provides a complete 
statement of all the corresponding values and a picture of the results. 

* Comptes rendu* de la Societt de Biologic , Tom. xcvi. pp. 572—575, March, 1987, and Tom. xen, 
p. 1009 (1925), Tom. xom. p. 1629 (1925). 
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We will investigate first bow we can determine the error due to the measurer. 
Let s x be the mean of the two lengths in the first measurement and be the corre¬ 
sponding quantity for the second measurement. Let l R mid L l be the truq lengths 
of the hemispheres, and e u e a . e/, e» the errors made in the measurements taken of 
the four lengths, then M 

Sjar + + + s*** £(£*4- &x) + h € i + !*•» 

and accordingly' 

and if <r denotes a standard deviation 

<*\-l t m H ^ + o 4 *,), 

approximately, for there is no reason to suppose high correlation of errors. 

Now cr\ « a\ and <r\ 8 « for they represent the variation in measuring the 
same quantity at a short interval. Further we have no strong reason to assert that 
any sensible difference was made in the manner of measuring left and right hemi¬ 
sphere lengths. Thus it seems fair to take a* H *= a\ t and accordingly we have 
Vzr-zt = where e stands for e lf e a , or e a '. In the next place the frequency 
distribution was formed of z x — * a , i.e. the difference of \(L R + L£) at the first and 
second measurements, and the mean and standard deviation obtained of this dif¬ 
ference. They are in mm.: 

Mean (1st measurement — 2nd measurement) *= — *1104, 

Standard Deviation *r gl - H «1*7856. 

Hence: Mean/probable error of mean — *1104/0446 *■ 2*47, 

and accordingly as the ratio is less than 2*5, it is not at all improbable that the 
difference in the means of \{L R + L l ) at first and second measurements may be due 
to errors of measurement. Now let us consider the means of L R and L h \'the probable 
error of measurement of both is *0446, and accordingly the probable error of their 
difference, if due to errors of measurement, is *0631, but the means of L R and L l 
differ by *9945 mm. or by 15*75 times the probable error of the difference. Thus al¬ 
though the probable error of a single measurement** *67449 x 1*7856»1*2043 mm. is 
considerable—as it must be when we remember the difficulties of taking an internal 
measurement by way of the foramen magnum—yet the process is sufficiently ac¬ 
curate when we take the means on 729 crania to prove that a mean length of rigkl 
hemisphere of 171*0446 is significantly greater than a mean length of left hemi¬ 
sphere of 170*0501 mm. In other words, if we may take our sample of over 700 
male Egyptian skulls as representative, the right hemisphere is in the main of 
greater length, and therefore probably of greater capacity than the left. 

The actual constants obtained from the measurements are: 

Mean L R **L R ** 171*0446 mm., Standard Deviation ® 5*9551 mm. 

Mean L h ** Tj,** 170*0501 mm., Standard Deviation ** 5*6908 mm. 
Correlation coefficient *■ *9603*. 

♦ From the grouped Table tf on p. 112. Worked from the formal* 

, % l u * (**!* + tfi* ***!*)» 

wekav^f^4 A «a*960S. ‘ 



TABLE IV. 

Internal Lengths of Right and Left Hemispheres (Condensed Table). 
Internal Length. Left Hemisphere in mm. 
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If we work from the formula: 

Correlation coefficient •» ~— — : —— j2 —* ——* , 

2 Wz, <r R <r L [ N J 

where 2 is a summation for all values of L R - L R greater than L l — X Lt we find 

Correlation coefficient — *9614. , 

These values are, of course, uncorrected for errors of measurement, and the last 
two obtained by very different methods are in good accordance. 

Since L r **L r 4-and we may suppose e* independent of the length measured, 
we have: 

<7*lm * o^Zr + <**•!’ 

and this leads to 

<7^ = 5*681053, and c th « 5*403434, 

for the true variabilities of L R and Z L . Thus the true variabilities of L R and L l are 
3*18 and 3*03 times the variability due to error of measurement, i.e. 1*785568. 

We are not able, however, to correct t LrLl for the errors of measurement; for if 

we multiply the crude value *9603 by Cl ~ - L l we find the corrected value somewhat 

exceeds unity or our <r, is too large . This we believe arises from the hypotheses 
(a) that measuremental errors in determining L R and L h are uncorrelated, for corre¬ 
lation may arise from the approximate symmetry of the skull which creates the 
same type of difficulty on either side, or again from the nature of the individual 
foramen magnum; and ( b ) the same conditions may influence the errors e lf e x ' (or 
e t , e 2 ') when measuring the same length twice. Such correlations will influence not 
only the size of <r< and so the sizes of <ri H and <ri Li but also the value of the product- 
moment S(L r L l ). Actually 

jy- $ (L r — Lii) ( L l — L l ) — er «* r tJ 

but as our data are inadequate to find r n<l (= probably r fl ' f# '), and we do not know 
r n«i' (" probably or (« probably r,^) we cannot determine <r. a . Our results, 

however, do indicate that these correlations differ from zero, even if the associations 
be only slight. 

We have seen that the errors of measurement are unlikely to account for the 
difference between mean L R and mean L l% we may now consider whether their 
difference *9945 could easily arise from random sampling*. As we do not know 

* !»*=£*+§!, hence: mean I^smean f^+mean ci« In taking the mean L M - mean mean 

L x - mean l L we merely suppose mean e^mean e a and not neoessarily mean e l =sero. It might 
he supposed that mean oould not be aero as we are a imin g at i^-true maximum length, but it must 
be remembered that this length is to be measured parallel to the median plane, and if there be error in 
this adjustment we may exoeed the true maximum, further there may be errors in excess in reading the 
instrument or in resetting it after extraction, so that we oannot by any means assert that dll «’s are 
positive. 

Biometrika xxx 


9 



114 On Measurement of the Internal Diameters of the SkuU 

the true value of the correlation of L R and L l we are compelled to use that of L & 
and L Ijt ie. *9603, which cannot differ much from it. We have then 

°*Lh-l * ZvIrVLl x ‘9603 

- 2*514,4343, 

or, via-h = 1 *585,697. 

Accordingly the probable error of Ir — L L) or *9945, is *0396; thus the difference is 
over 25 times its probable error. 

Lastly if we consider the observed difference of mean L R — mean L Lt its observed 
standard deviation is 1*664,3155 and therefore the probable error of the difference 
is *0416, or the difference is 23*9 times the probable error, this last proceeding 
neglecting the distinction between variation due to random sampling and that due 
to measuremental errors. Whichever way we approach the problem we find the 
significance of the difference between the lengths of right and left hemispheres 
emphasised. 

We may attempt to compare our results for greater right hemispherical length 
with those of Braune and others for the relative weights. Thus let us consider 
two skulls both of which are symmetrical , but one of which has the mean internal 
length 171*0446 and the other 170*0501—i.e. the mean right and left hemisphere 
lengths—further, we will suppose the internal height and breadth to have their 
mean values, the capacities of these would be, by the formula on p. 92, 1435*39 
and 1442*80 cubic centimetres*. Now Braune gives 1260*61 grs. for the mean 
weight of the German encephalon; assuming his material was chiefly male, tho 
cranial capacity of Germans would be about 1500 cm. 8 or 1 cm. 3 of capacity corre¬ 
sponds to *8406 gr. of brain weight. Our two symmetrical brains would therefore 
correspond to weights of 1206*59 and 1212*82 grs. respectively for the entire brain. 
But Braune found the ratio of weight of encephalon to cerebrum to be 1260*61 to 
1100*89, or the factor is *87329, accordingly the weights of the cerebrum in these 
two symmetrical brains would be 1053*70 and 1059*14 or their hemispheres 
526*85 and 529*57. If these two hemispheres be put together to form a single 
asymmetrical brain with the larger weight on the right side, they would correspond 
to a mean skull with the difference in length of right hemisphere over the left equal 
to *9945 mm. This would give a preponderance in weight of right hemisphere of 
brain = 2*72 grs. Braune found a difference of 1*57 grs., which we have shown to 
be significant, and Broca of 1 *97 grs. The difference between our result and Praune's 
is about three times the probable error of the difference. Such a difference might 
very well be anticipated for we are dealing with two very different races at very 
* The reader must bear in mind that the mean value P { of the internal diametral product is not the 
product of the three mean internal diameters, i.e. whioh is 2901*94 cm. 8 corresponding to a 

oapacity of 1488*80 cm. 8 , and not 1440*80 cm. 3 , the mean value. The true mean P i to a first approxima¬ 
tion is _ _ 

Pt—i'i Hi (1 + r n i r/ t + v Bi v Li r Ui ^ * t ), 

where the i>’s are coefficients of variation. This giveB 2906*08 cm. 3 — a reasonable approach to the 
true valne 2906*58 cm.*. The slight difference 0*5 cm. 8 is due chiefly to the efifeot of grouping in deter¬ 
mining the means of Pi> L { , B { and H it and not to the need for a further term in the approximate 
value. 
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different periods of history, and we have further neglected all differences in size of 
the hemispheres except their length. It may well be that there exists some com¬ 
pensatory factor at work, so that a long cerebral hemisphere may be less high and 
broad. Further our method of approximation is very rough, it being improbable 
that the weights of encephalon and cerebrum are really proportional to their cubic 
capacities. Still the fact remains that the best weighings and the moat complete 
series of measurements yet made lead to the same conclusion, a pre-eminence of the 
right, not the left side of the cerebrum. If it were not that so many anatomists 
have sought to associate a psychical pre-eminence of the left with size or weight, 
we should have been inclined to say ab initio that such pre-eminence, if it exists, 
would have little relation to such crude physical characters as these. Indeed 
we should doubt whether it would be found markedly exhibited in anything short 
of microscopic differences in corresponding tracts of the grey substance of the two 
hemispheres. 

Before we conclude this matter we may draw some further inferences from 
our Fig. 1, on p. 110 above, where the readings are taken to the nearest half 
millimetre. The dotted diagonal line marks the cases in which to within the unit 
of measurement the two hemispheres were of equal length. There are 81 cases or 
1 I’ll %' Adding up the cases in lines parallel to the diagonal line we get the 
accompanying frequency distribution for L b -L l , the length of right hemisphere 
minus length of left. This shows us that in 68*59 % of cases the right hemisphere 
was larger than the left, and that it was only less than the left in 20*30 % cases. 
If we divide up the 11*11 %, in which to our unit of measurement the lengths are 
equal, in the ratio of the excess of left over right to excess of right over left, we find 
on adding these 2*54 % and 8*57 % to the 20*30 °/ 0 and 68*59 % respectively, 22*84% 
of cases of the left hemisphere in ^excess and 77*16 % cases of the right. As the 
process is a rough one these percentages are close enough to 25 % and 75 % for a 
very ardent Mendelian to suggest that lesser size of left hemisphere is inherited as 
a Mendelian recessive, and corresponds with the asserted 25 % of left-handedness. 
But if size has anything to do with dominance of control this relationship will 
scarcely work, as it is the left hemisphere which is supposed to control the right- 
handed person’s movements. 
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the distance from origin of curve to mean being -18*503496, and from mean to mode -4* *477868. 
The unit of x is 0*5 mm. The standard deviation—3*328631 —1*6643155 in mm. 
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To test the percentages still further the frequency was fitted with a Pearson 
Type IV curve (see Fig. 2) and the areas on either side of zero read off with a plani- 
meter; the percentages then obtained were: 

Length of K. Hemisphere > Length of L. Hemisphere 75*48 %, 
»»*♦»> ^ >» »» » 24*52 f 0 * 

In other words the number of cases in which the left hemisphere is shorter than 
the right is exceedingly close to the Mendelian quarter. 

We may judge from these results that the right hemisphere of the cerebrum, so 
far from being inferior in size or weight to the left hemisphere, is slightly superior 
to it, and the proportion of cases in which it is in excess of the left corresponds closely 
to the number in which the right side of the skeleton exceeds the left, or, as some 
have asserted, the proportion of right-handed to left-handed individuals, which has 
been claimed to be a Mendelian three-quarters. We do not consider that this is in 
any way opposed to the view that—at any rate for some functions—the left 
hemisphere may predominate, because we do not believe that a volitional pre¬ 
dominance is of necessity associated with such gross characters as size or weight. 
Consequently we do not think that the present result must be taken in any way to 
strengthen the assertion recently made on the basis of monumental art that the 
ancient Egyptians were a left-handed race*. 

There is a slight—a very slight—predominance of gross physical characters, 
possibly in the ratio of 75 to 25 for the right side of the body, and this extends to 
the right side of the brain notwithstanding the decussation of nerves, which it has 
been held must connote a gross physical pre-eminence of the left hemisphere 
in manual dextralists. 

Appendix of Tables. 

I. Capacity and Internal Breadth. 


II. 

»» 

„ External „ 

III. 

w 

„ Internal Height. 

IV. 

»» 

„ External „ 

V. 

H 

„ Internal Length. 

VI. 

ft 

„ External „ 

VII. 

Length Thickness and Breadth Thickness. 

VIIL 

» 

„ „ Height 

IX. 

Breadth 

)» » » V 

X. 

External Length and External Breadth. 

XL 


Breadth „ „ Height. 

XII. 

» 

Length „ „ 


fr^WL’ 


See Man , Yol. xxm p. 137, but also Vol. nix, p. 24. 
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XIII. Internal Length and Internal Breadth. 

XIV. „ Breadth „ „ Height. 

XV. „ Length „ „ „ 

XVI. External Length and Internal Length. 

XVII. „ „ „ „ Breadth. 

XVIII. „ .. Height. 

XIX. Internal Breadth and External Height. 

XX. „ „ „ Breadth Thickness. 

XXL External Diametral Product and Capacity. 

XXII. Internal ,, „ „ „ 



APPENDIX OF TABLES 


M. F. Hoadley 


0 


to 1ft lO iO o o 


I I I tl H 



I i I I I f I I I I I I I I 


I I I I I *> 


I ! I I I I I I I * II U 


Mill— I— 1 ‘PIIII 


I I I I * I I | | I I | I I 


*pg«-p | | | | | | 


•p tp 

-L Wt-05 


1 

1 

1 1 

1 

1 

I 

CM 

p 

8-5 

5-75 

iO 

CM 

b 

iO 

b 

»p 

1 

ip 

I 

1 

1 

i 

1 

1 

1 1 

j 

1 

I 

1 

»p 

p£ 

b »b 

iO 

b 

p 

CM 

iO 

CM 

ip 

1 

I 

1 

l 

1 


1 

1 1 

1 

1 

p 

CO 


—i 

»p 

l.'- 


CO 

1 

1 

i 

1 

I 

i 

1 

1 

1 1 

1 

1 

p 

CO 

•o 

t-» O ^ 

<o 

b 

co 

- 

1 

1 t 

i 

i 

1 

i 







*a> 

ITS 

ITS lO 










1 

1 

1 1 

I 

1 


I- 

CM 

CM J>- 




1 

j 

1 

| 

| 

i 

1 

1 

1 1 

1 

1 


b 

&> 

b b 


CO 

1 —i 

1 


1 

1 

1 

i 








r— 

r— r-1 
















•a 

iO 

»0 iO 










1 

1 

1 1 

1 


*p 

CM 

CM 

CM 

•p 

p 

\ 

1 

I 

| 

I 

| 

i 

1 

1 

1 1 

1 


CO 

OS 

s 

b CM 

b 


I 

1 

1 

I 

t 

1 

i 




_ 





i—i r—« 















o 


*p 

*P p 

p 

p 








1 

1 

1 1 

1 

1 

b 

CO 

P-H 

-rf 4 -i 

b 

r—* 

co 

1 

1 1 


1 

1 

1 







r **’ 1 

CM 

r—1 i—1 


















iO urs 










1 

1 

j 1 

iO 

p 

p 

p 


A r* t r 

p 

p 

1 

1 

1 | 

1 

1 

| 

i 

1 

I 

1 1 



b 


8 

CM do 

*“* 


1 

1 

1 1 

1 

I 

! 

i 









ip *0 










1 

1 

1 1 





ip 

CM b- 


1 

1 

1 

1 | 

I 

1 

1 


1 

1 

1 1 

w 

co 

b 

l-H 

O 

b co 

pH 

1 

1 

1 

1 1 

1 

1 

1 























p 

iO 




»p 










1 

1 

1 1 


b 

CM 


t— 

t- r^-t 

I 

1 

j 

1 

1 I 

1 

I 

i 

1 







r—l 

r—< 















10 

lO 













1 

1 

p 



CM 

p 

p 

Jp lO 

| 

| 

1 

1 

| | 

| 

| 

1 




— 

CM 

CO 

iz. 

OS 


CM 














iO 

*G> 













1 

1 

o 

»o 

(N 




lO 

| 

| 

1 

1 

| 1 

| 

[ 

1 




(N 

CM 

<r> 

CO 

r- 

ICO 

|H 










1 

t 

1 




p 


»p 

1 

| 

1 

1 

1 I 

| 

1 

i 

i 

1 

1 

1 1—1 

r—i 

OO 

OS 

ib 



1 

! 

1 

J 

I t 

1 

1 

i 

1 



_l I II I I II I I I 1 I 


I I I I I I I I I I M I II I 


H | | | | II | | | | | I 11 I I I | 




IBJSU&O) «’ 1110 Ap*d«9Q 


Totals! 2 1*5 3*5 5*5 6 13 16*5 26 28*5 34*5 63*5 





















































120 On Measurement of the Internal Diameters of the Skull 


9 

PQ 

IS 


8 

3 

2 

■+3 

a 

<D 

o 

s 

s 

.a 


*08 

e 

■g 

w 


Totals 

■o ns « •© ift <o >a to 

~ ! till- 

V* T*4 

729 

191 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1- 1 1 1 1 1 

- 

... 


: 


1 I 1 II l 1 1 II 1 - 1 I 1 M 1 1 1 1 


rm 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

i 

89T 

■HlHWBIIBIilim 

lO 

T91 

1 1 1 1 1 1 1 1 1 1 1 1 1 * 1 1 1 1 It 1 

»© 

091 

1 1 1 1 1 1 1 1 1 1 Mill!-, 

a> 

6tT 

1 1 1 1 i M II s 1 £« M 1 1 1 II 1 

CD 

&ri 

I l 1 l l l i l L II 1 l 1 

o 

pH 

Itl 

I 1 1 1 1 1 1 1 tO t© >0 1 1(||(|| 

to 

CD 

9tT 

11 1 1 1 1 1 1 rM iO ift C« ' 1 1 * 1 11 1 

t© 

OO 

»—i 

9*Tl 

1 1 ! 1 t 1 1 1 1 1 1 1 1 1 

1 « » • 1 1 1 HN«00®N»0 1 I 1 1 1 1 * 

s 

tfl 

k _ »o 

• i | i i i i CM CM illl l 1 

1 | | 1 1 | | • 1 1 1 1 1 1 


VtT 

1 1 I 1 I 1 o o II 1 1 1 1 

1 1 M (N lO r-r-»f (M W 

CO 

CO 

rfri 

111) | 1 1 1 1 1 I 1 

1 1 1 1 —• |CKICOj--J»O*OC3DC0r~. | | 1 | | | | 

t© 

CO 

*o 

rtr 

| | | | f | 1 1 1 1 1 1 1 


ofr 

I | | i | codoM«W' nHH 1 I 1 1 1 1 1 


68t 

| | | i WifJOS^QOTf (NMrH | 1 | 1 | | 1 

s 

887 

• <■> i *o to o >n 

| | | | loiooo^e^abcor-rl | | | 1 [ I 1 


JL8T 

1 | 1 | fN«oo«ffi6»b w I 1 I 1 I 1 I I 1 

<2 

•© 

98 T 


k© 

8 

981 

1 1 ~«gg.o„=o£« 1 1 1 1 1 1 I i 1 1 

»© 

'ft 

t#T 

Illl 1 1 1 1 1 1 1 1 1 1 1 

£— 

881 

1 1 1 1 II 1 1 1 1 1 1 1 

k© 

8 

8&T 

| | | «, | *•«« || | | || 1 1 1 1 1 1 1 

CO 

T8T 

1 1 1-S2« 1 - 1 1 1 1 1 1 1 1 1 1 1 

00 

087 

uyu^mMimnmn 

o 

P-H 

68T 


© 

887 

Biufciii—niinmi 

to 

187 

11111111111 it 11111111 

I 

9% I 

1 1 1 1 1- 1- 1 1 1 l II M 1 1 M 1 

m 


g g ^ ;; M ^ g J© 

Totals 


j«a^uaQ) g *oxo m ^toedfaf) 



















































TABLE III. Capacity and Internal Height 
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TABLE VII, Length Thicknm and Breadth Thicknm. , 
Difference of External and Internal Lengths in mm. 
























































Difference of External and Internal Heights! 

in mm. ; 
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TABLE XL External Breadth and External Height. 

a* J:l *• 
















































TABLE XiK; Internal Length and Internal Breadth. 
Internal Length in mm. (Central Values). ' 






















































External Length in mm. (Central Values). 
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_External Leagtb m nnra. (Central Vsluge). 
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TABLE XXI. External Diametral Product and Capacity, 


External P, Length x Breadth x Height in cm . 8 (Sub-ranges). 
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TABLE XXII. Internal Diametral Product and Capacity. 


Internal P, Length x Breadth x Height in cm . 8 (Sub-ranges). 
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ON THE DISTRIBUTION OF THE RATIO OF MEAN TO 
STANDARD DEVIATION IN SMALL SAMPLES FROM 
NON-NORMAL UNIVERSES. 

By PAUL R. RIDER, Ph.D., Department of Mathematics, Washington Univer¬ 
sity, St Louis; Sterling Research Fellow (1928—29), Yale University. 

Introduction. 

Let X and s be the mean and the standard deviation respectively of a sample 
of n drawn from a universe having mean M and standard deviation <r. Let at be 
the deviation, X - M, of the mean of a sample from the mean of the universe. 
The ratio z = xjs (or t=z •Jn — 1) plays an extremely important part in a number 
of statistical problems such as determining the probability that the mean of the 
sample does not deviate from the mean of the universe by more than a stipulated 
amount, comparing two mean values, and finding the sampling errors of re¬ 
gression coefficients*. The distribution of z for samples of n from a normal 
universe has been completely determined—originally by “ Student," who applied 
it to the first of the problems just mentioned—and tables of the distribution for 
various values of n have been constructed!. 

The derivation of this distribution is based upon the following assumptions: 

1. That the means of samples of n are normally distributed with standard 
deviation oj'Jn. 

2. That the distribution of standard deviations, s, is given by 

,, . , n <»—1)/2 s n-s g-n»»/2<r» 

f(s)ds> - 

2«»-9)/2 0-n-l f 

in which /(*) ds is the probability, except for infinitesimals of higher order than 
ds, that a value of « will fall within the range (s, s + ds). 

* See R. A. Fisher, “ Applications of'Student's’ distribution," Metron, Vol. v. (1925), pp. 90—104; 
“Statistical Methods for Research Workers,” passim. 

t “Student,” “The probable error of a mean,” Biometrika, Vol. vx. (1908-9), pp. 1—25 (see also 
Tables for Statistician* and Biometricians, pp. xliii—xlv, 36); “Tables for estimating the probability that 
the mean of a unique sample of observations lies between - op and any given distance of the mean of 
the population from whioh the sample is drawn,” Bionettilta, Vol. xi. (1915—17), pp. 414—7 ; “ New tables 
for testing the signifioanoe of observations,” Metron, Vol. v. (1925), pp. 105—8,118 — 20. B. A. Fisher, 
loe. cit.; also “Note on Dr Burnside's reoent paper on Errors of Observation,” Prooeedingi of the Cam¬ 
bridge Philoiophical Society, VoL xxi. (1928), pp. 655—8. 
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3. That x and a are independent. 

While these assumptions have been justified for the case of a normal sampled 
population* * * § , they do not hold otherwise, and we do not know how well “Student's” 
probability integral applies. It has been shown in some interesting experiments 
by Shewhart and Wintersf that it applies—as is to be expected from theory— 
to normal universes, and that for certain types of non-normal universe it gives 
better results than does the normal probability integral (i.e. than assuming z to 
be normally distributed). However, these same experiments indicate that it fails 
to a degree sufficient to warrant further extension of theory. 

The failure seems to be due chiefly to the correlation which always exists 
between x and 8 in samples from a non-normal universe. If the Pearsonian yS's of 
the universe satisfy the relation y8 a - 3 » 0, the regression of the variance, s 2 , 

on x is linear; in other cases the regression is well represented by a second order 
parabola For points in the ySi^-plane lying above the line # a ““£i“3 = 0§, 
the parabola in the ( x t s^-plane is concave upward (that is, its branches are directed 
in the positive sense of the s*-axis), while for points lying below this line the 
parabola is concave downward. (See Fig. 3, p. 132.) For the sake of definiteness, 
let us consider the latter case. For large numerical values of x the value of a 2 
tends to be smaller than its mean value, which denotes that s is smaller on the 
average, and that | z | = | x \/s tends to be larger. As a consequence, a greater 
number of zs would be expected to lie outside a certain value of | z | than in the 
case of a normal universe. On the other hand, for values of x near zero the value 
of a 2 , and also that of a, have a tendency to be larger, causing the values of | z | to 
be smaller. The effect of this would be to make a greater number of z'\ s lie 
inside a certain value than when the sampling is from a normal universe. The 
actual existence of the first effect was verified by the Shewhart-Winters' experi¬ 
ments. Both effects are shown theoretically in the present paper. 

Several types of universe have been investigated, but the rectangular has 
been studied in greatest detail, as it is the simplest from the standpoint of the 
method employed. The rectangular distribution may be regarded as a special 
case of a considerable number of Pearsonian types||; it is sufficiently different 
from the normal distribution to test whether such a difference causes an appreci¬ 
able departure from the theory which is based upon an assumption of normality 

* In addition to the references already cited, see B. A. Fisher, “Frequency distribution of the values 
of the correlation coefficient in samples from an indefinitely large population,” Biometrika , Vol. x. 
(1914—15), pp. 507—21; and Karl Pearson, “On the distribution of the standard deviations of small 
samples,” Biometrika , Vol. x. (1914—15), pp. 522—9. 

t See W. A. Shewhart and F. W. Winters, “Small samples—new experimental results,” Journal of 
the American Statistical Association, Vo), xxm. (1928), pp. 144—58. See also “Sophister,” “Discussion 
of small samples drawn from an infinite skew population,” Biometrika , Vol. xx A . (1928), pp. 889—423. 

X See J. Neyman, “On the correlation of the mean and the variance in samples drawn from an 
‘infinite’ population,” Biometrika , Vol. xvm. (1926), pp. 401—18. 

§ It is assumed that the positive direction is to the right on the ft-axis and upward on the /S^axis. 

|| See T. L. Kelley, Statistical Method , p. 128. 
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in the sampled population; moreover, it is one of the types of universe employed 
by Shewhart and Winters in their experiments. 

Besides the distribution of z , the distributions of various statistical para¬ 
meters (mean, median, range, extreme average, greatest variate, and least variate) 
of samples from a rectangular universe are discussed. It is shown that it makes 
little difference in the distributions of these parameters whether the sampled 
population is continuous or discrete. 


Rectangular Universe. 


If with the quantity Xi (i = 0, 1, ..., m —1) is associated the probability 
1), the chance that a random sample of n X’s will contain n 0 Jf 0 ’s, n x Xx’a, 
..., n m ~j X m ~i& (/i 0 + n x + ... + n w _! = n) is 


n ! 


>! n i ! 


• n 7H~l 


p 0 ri °Pi w ' -.-Pm- 1 *""' 


which is the general term in the multinomial expansion of 

(Po + p!* +Pm-i) n * 

Let us apply this to samples of 4 from an infinite* rectangular distri¬ 
bution of. five classes. Suppose that the variate X may assume the values 
0, 1, 2, 3, 4. Then, since the probability that X will have a specified value is 1/5, 
we see by the foregoing that the probability of getting a sample in which all the 
digits are alike is 1/625; the probability of getting a specific sample in which 
three are alike, such as 0004, is 4/625; the probability of getting a specific 
sample in which two are alike of one kind and two alike of another, e.g. 1122, is 
6/625; of getting a specific sample of two alike and two different, e.g. 1124, 
is 12/625; and of getting a specific sample in which all are different is 24/625. 
Thus we know the probability of obtaining a specified sample, and it is of course 
a simple matter to calculate the value of z = (X — M)/s for the sample. The 
distribution of z y which is symmetric, may be seen in Table I. 


We can now compare our results with those for a normal universe. Perhaps 
the best way to do this is by plotting on probability paperf. This is done in 
Fig. 1. The smooth curve shows the cumulated probability for' random samples 
of 4 from a normal universe}, while the series of irregular steps shows the 
corresponding probability for samples of 4 from the rectangular universe of five 
classes. It is seen that although for the larger numerical values of z the broken 
line lies on the concave side of the curve, it quite often crosses it for smaller values 
of \z\, say |*|< 1. 


* For a description of what is meant by sampling from an infinite population: see A. E. R. Church, 
“On the means and squared standard-deviations of small samples from any population/* Biometrika, 
Vol. xviii. (1926), p. 828. 

t For an explanation of probability paper see G. C. Whipple, Vital Statistics , Chapter xii. 

$ Values were obtained from Tables for Statisticians and Biometricians , p. 86. 
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TABLE I. 

Probability of z for Samples of 4 from Rectangular Universe office Classes. 


z 

(Exact) 

z 

(Decimal) 

625 x 

Probability 
of z 

Cumulated Probability 

for 

for +* 

0 

0 

85 

*5680 

•5680 

n / 51/51 

*1400 

12 

•4320 

•5872 

*/36/35 

•1690 

24 

*4128 

*6256 

^3/9 

•1925 

16 

•3744 

•6512 

^19/19 

•2294 

12 

*3488 

•6704 

VIT/11 

•3015+ 

24 

*3296 

*7088 

1/3 

•3333 

18 

*2912 

•7376 

<J&/6 

•4472 

24 

*2624 

•7760 

3^43/43 

*4575“ 

12 

•2240 

•7952 

v'3/3 

*5774 

28 

•2048 

•8400 

3«/f9/19 

•6882 

12 

•1600 

•8592 

Vfi/3 

•8165“ 

12 

•1408 

•8784 

SVTT/Il 

•9045+ 

12 

•1216 

•8976 

5 V3/9 

•9623 

4 

*1024 

*9040 

1 

1-0000 

12 

•0960 

*9232 

Va 

1-4142 

12 

■0768 

•9424 

6«/ll/ll 

1*5076 

12 

•0576 

•9616 

V3 

1*7321 

8 

•0384 

•9744 

5 v'3/3 

2*8868 

4 

•0256 

•9808 

3 

3’0000 

6 

*0192 

*9904 

7^3/3 

4*0415+ 

4 

•0096 

•9968 

00 

00 

2 

•0032 

1-0000 


The cumulated probability is the sum of all the probabilities from z~ - oo to s* z inclusive. 
It is the probability that the mean of a random sample will not exceed (in algebraic sense) the 
mean of the universe by more than z times the standard deviation of the sample. 

Tables II and III give the probabilities for samples of 3 and 2 respectively 
from the five-class rectangular distribution. They may be compared with the 
tables of “ Studentfor samples from a normal population. 

* “Student,” “Tables for estimating the probability that the mean of a unique sample of observa¬ 
tions lies between - oo and any given distance of the mean of the population from which the sample is 
drawn,” Biometrika , Vol. xi. (1915—17), pp. 414—17. 
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Fig. 1. Cumulated Probability of z . 

The curve is for samples of 4 from a normal universe. 

The steps are for samples of 4 from a rectangular universe of five classes. 


TABLE II. 

Probability of z for Samples of 3 from Rectangular Universe of five Glasses . 


z 

(Exact) 

z 

(Decimal) 

125 x 

Probability 
of z 

Cumulated Probability 

for ~ z 

for -yz 

0 

•00 

19 

•576 

•576 

n / 26/26 

•20 

6 

•424 

•624 

V14 / J4 

•27 

6 

•376 

•672 

V2/4 

• 35 + 

6 

*328 

•720 

'/ 14/7 

•53 

6 

•280 

•768 

V2/2 

•71 

9 

•232 

•840 

\/ 6/2 

1*22 

6 

•160 

•888 

V2 

1*41 

6 

112 

036 

2^2 

2-83 

3 

•064 

•960 

6 V2/2 

3-54 

3 

•040 

•984 

X 

00 

2 

•016 

1-000 
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Probability of z for Samples of 2 from Rectangular 
Universe of five Classes . 


z 

25 x 

Probability 
of z 

Cumulated 

Probability 

-00 

2 

*08 

-3 

2 

*16 

-1 

4 

•32 

-1/3 

2 

•40 

0 

5 

•60 

1/3 

2 

•68 

1 

4 

•84 

3 

2 

•92 

00 

i 

2 

1-00 


A better picture of the situation can be obtained if we employ a rectangular 
universe having ten classes. Suppose then that the variate X can assume the 
values 0,1,2,...»9, with the probability of its equalling a specified digit being 1/10. 
The results are shown in Tables IV and V and in Fig. 2. 



Fig. 2. Cumulated Probability of *. 

The curve is for samples of 4 from a normal universe. 

The small oirdes are for samples of 4 from a rectangular universe of ten classes. 
Biometrika zzx 10 
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Table V and Fig. 2 compare the probabilities, for a ten-class rectangular 
universe and a normal universe, that the deviation of the mean of the sample 
from the mean of the universe, expressed in terms of the standard deviation of 
the sample, will not exceed a given amount. Perhaps the most noticeable pheno¬ 
menon, evident from the table rather than from the graph, is that the cumulated 
probability curves cross for values of z between — 0*8 and — 07, at 0, and again 
between 0*7 and 0*8. This not only proves theoretically the existence of the effect 

TABLE IV. 

Probability of z for Samples of 4 from Rectangular 
Universe of ten Classes. 


z 

Probability 

Below -4*25 

•0077 

— 4 

•0022 

— 3 *5 

•1)026 

-3 

*0032 

-2*5 

•0074 

— 2 

•01H8 

- 1 *5 

•0267 

-1 

*0092 

-0*5 

•2000 

0 

*3244 

0*5 

*2000 

1 

*0092 

1*5 

*0267 

2 

■0188 

2*5 

*0074 

3 

*0032 

3*5 

*0020 

4 

•0022 

Above 4 25 

•0077 


discovered experimentally by Shewhart and Winters*, but also shows the other 
effect anticipated in the Introduction, viz. a greater clustering of values of z about 
the origin. This other effect is not marked, and Fig. 2 would indicate that for 
] z ! < I “ Student’s ” theory, based on the hypothesis of a normal universe, is an 
excellent approximation in the case of a rectangular universe. The ^regularity, for 
large numerical values of z , of the graph of the probability for the rectangular 
universe, is due to the paucity of such valuesf. 

Before taking up other types of universe let us study the regression of 
variance on mean for the rectangular universe of five classes., The scatter diagram 
of variance and mean is shown in Fig. 3, from which it will be seen that the 
second order parabola of regression as given by NeymanJ is in excellent agreement 

* loc . eit. 

t For the effects of the grouping see ‘‘Student,” “The probable error of a mean,” loc . cit . p. 14. 

$ J. Neyman, “ On the correlation of the mean and the variance in samples drawn from an ‘infinite’ 
population,” Biometrika , Vol. xvin, (1926), pp. 401—18. 
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with the actual curve of regression. Table VI exhibits the data from which the 
regression chart was constructed. 


Other Types of Universe. 

The other universes considered are shown in Table VII. Each has but five 
classes. Universe N has its first two y3’s (the moments are uncorrected for grouping) 
identical with those of a normal universe, B is the symmetric binomial (1 +1) 4 , 

TABLE V* 

The Cumulated Probability of z> or Probability that the mean of a Random 
Sample of 4 will not exceed (in algebraic sense) the Mean of the 
Universe by more than z times the Standard Deviation of the Sample . 


2 

Cumulated Probability 
Rectangular Universe+ 

Cumulated Probability 
Normal Universe 


for - z 

for +z 

for - z 

for +* 

0 

*5335 

•5335 

•50000 

*50000 

*1 

*4281 

*5719 

•43676 

•56324 

•2 

•3649 

•6351 

•37595+ 

*62405- 

*3 

•3109 

•0891 

*31962 

•68038 

*4 

■2621 

•7409 

•26912 

*73088 

*5 

*2167 

•7857 

•22509 

•77491 

'6 

*1811 

•8189 

•18755- 

•81246+ 

*7 

•1517 

*8483 

*15606 

•84394 

■8 

•1335 

•8671 

•12995+ 

•87005- 

•9 

*1129 

*8871 

*10846 

*89154 

1*0 

*0905 

•9095 

•09085" 

•90915+ 

ri 

•0853 

•9147 

•07642 

•92358 

1*2 

0737 

•9263 

•06460 

*93540 

1*3 

•0659 

•9341 

•05488 

•94512 

1*4 

•0565 

•9435 

•04688 

•95312 

1*5 

•0537 

•9469 

•04025+ 

•95975“ 

1-6 

•0451 

•9549 

•03475+ 

•96525- 

1 *7 

*0439 

•9561 

*03015“ 

•96985+ 

1-8 

*0383 

*9617 

•02628 

•97372 

1*9 

*0347 

•9653 

•02302 

•97698 

2*0 

•0335 

*9689 

*02026 

*97974 

2*1 

*0271 

•9729 

*01790 

•98210 

2*2 

*0231 

*9769 

•01589 

*98411 

2*3 

•0231 

*9769 

•01415+ 

•98585“ 

2*4 

*0223 

*9777 

•01266 

•98734 

2*5 

0223 

•9783 

•01136 

•98864 

2*6 

•0193 

•9807 

•01022 

•98978 

2*7 

•0169 

•9831 

00923 

•99077 

2*8 

*0167 

•9843 

•00837 

•99163 

2*9 

*0137 

•9863 

•00760 

•99240 

3*0 

*0125 

•9875 

•00692 

•99308 

3*6 

•0105 

•9901 

•00450 + 

*99550- 

4*0 

•0087 

•9919 

•00308 

•99692 


* A more detailed table is given in an Appendix to this paper, 
t Having ten classes. 


10—2 
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or—Mean of Sample - Mean of UniverBe. 

Fig. 8. Regression of Variance on Mean (Rectangular Universe of five Classes). 

The circles, connected by the solid curve, are the means of the oolumns 
of the scatter diagram. 

The crosses, connected by the dashed curve, are points on the second 
order parabola of regression as given by Neyman. 


TABLE VI. 

Regression of Variance on Mean for Samples of 4 from 
Rectangular Universe of five Glasses . 


X 

« * 

*x 

/* 

X 


/« 

0 

1 *8353 

85 


•7875 

m 

± 2b 

1 *76 

80 


*45 


± *5 

1-6324 

68 


•1875 

■Hff 

± *75 

1 4567 

52 


0 

1 

±1 

1-2 

35 





« x *=mean variance of column. 

/* =frequency of variances for a given column. 
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B* is the asymmetric binomial (3 + 2) 4 . Results for samples of 4 from N, B and R 
(rectangular universe of five classes) are compared in Table VIII. The table can 
readily be completed for positive values of z. Table IX gives results for the asym- 
metric binomial universe. 

The grouping in these universes is so coarse that results are unsatisfactory. 
The method, however, can be applied to any type of (discrete) sampled population 
and it is desirable that it be applied to other universes with more classes as it was 
applied to the ten-class rectangular universe. 

An interesting case is that in which the sampled population contains only two 
classes of individuals. For such a population the relation —fti — 1« 0 is satisfied 
and the variance, s 8 , is a definite function of #. For example, suppose the variate X 
can assume the values Xq and X* with the probabilities 1 — p and p respectively. 
Then it can be shown # that s 8 «(Xj - w) (x - X 0 ). 

The probability of obtaining r X/s and (n — r) X 0 s in a sample of n from this 
universe is p r (1 - p) n ~ r . The mean of such a sample is (1/m) [(n -r) X 0 + rZJ, 
and its deviation from Xo+p(Xx — X 0 ), the mean of the universe, is 

x = (r/n — p) (Xi — X 0 ). 

The standard deviation of the sample is s =* (1/n) Vr (n - r) (Xx - X 0 ), whence 

g _ r — np 
Vr (a — r) 

TABLE VII. 


Universe 



ID 

X 

D 

/ 

/ 

0 

1 

1 

81 

1 

20 

4 

216 

2 

54 

6 

216 

3 

20 

4 

96 

4 

1 

1 

16 

Totals 

96 

16 

625 

01 

0 

0 

004 

ft 

3 

2*5 

2*54 


See Neyman, loc. ext . p* 408. The formulae for o -2 in (9) and (10) are incorrect; they sh ould be 

and <r s =(6-5) (i-o)in (10). 
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TABLE VIII. 

Cumulated Probabilities of z for Samples of 4 
from Universes N t B and R . 


z 

N 

B 

R 

— 00 

•0019 

•0039 

•0032 

- 4*04 

•0010 

•0042 

•0096 

-3 

•0019 

•0056 

•0192 

— 2*89 

•0023 

•0096+ 

•0256 

-1*73 

•0*226 

•0333 

•0384 

- 1-51 

*0228 

•0377 

•0576 

— 1 *41 

*0258 

*0553 

•0768 

-1 

*1084 

•1113 

0960 

- *96 

*1084 

*1116 

*1024 

- -90 

•1167 

•1380 

1216 

- *82 

*1167 

•1409 

•1408 

- *69 

•1179 

1526 

1600 

- *r>8 

•2813 

•2386 

•2048 

- *46 

•2813 

•2393 

•2240 

- *45 

•2874 

•2745- 

•2624 

- *33 

•2875- 

•2789 

*2912 

- *30 

■3485+ 

•3503 

•3296 

- *23 

•3568 

•3767 

•3488 

- *19 

*3583 

•3923 

*3744 

- *17 

•3586 

•4011 

•4128 

- *14 

•3586 

•4018 

•4320 

0 

•6414 

•5982 

*5680 


TABLE IX. 

Cumulated Probability of z for Samples of 4 from Universe B. 


* 

Cumulated 

Probability 

z 

Cumulated 

Probability 

z 

Cumulated 

Probability 

— GC 

•0146 

- *07 

•5254 

•78 

*9032 

-3*12 

•0176 

- -06 


•80 

•9144 

-2*20 

0296 

•10 

•5367 

•84 

•9152 

- 1*96 

•0510 

•12 

*5536 

•89 

•9162 

-1 *27 

•0540 

•14 

*5821 

1-04 

•9254 

-1*03 

•0781 

•18 

•6682 

1-06 

*9279 

- -89 

*1351 

•20 

•6583 

1*14 

•9283 

- -85 

•1993 

•25+ 

*6625" 

1*39 

•9340 

- -81 

•2564 

•28 

•6672 

1*40 

•9344 

- *65+ 

•2577 

•33 

•6926 

1-50 

•9598 

- -60 

•2698 

•35- 

*7496 

1*80 

•9767 

- -49 

•2805- 


•7500 

1*98 

•9792 

- *42 

•3447 

•40 

•7669 

1-99 

•9796 

- *35 

*3449 

•44 

•7711 

2*19 

•9797 

- *32 

•3734 

•50+ 

•7787 

2*66 

•9847 

- -27 

•3841 

•54 


3*80 

•9848 

- -21 

•3859 

*57 

•8551 

3*81 

•9851 

~ -20 

•4715- 

•60 

•8692 

4-97 

•9852 

- *12 

•5182 

•70 

•8693 

oo 

1-0000 
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The probability that the mean of a sample will not exceed (in algebraic sense) the 
mean of the universe by more than z times the standard deviation of the sample 
is the incomplete binomial moment of order zero, 

The Distributions of Various Statistical Parameters in Samples 
from a Rectangular Universe. 

Mem. The distribution of means from the five-class rectangular universe is 
shown in Table X. As usual, x is the deviation of the mean of the sample from 
the mean of the universe; p is the probability. It is found that the third differ¬ 
ences are constant, except for discontinuities at x = 0, ± 0*25. Changing the unit 
by the substitution x — of, in order to compare with certain results of Irwin* for 
a continuous rectangular universe, and fitting with cubic curves, we have 
PS3) = f (8/3)(1*02 -0*12 |f |-24^+48|») for f £ 1/4, 

•05 ((16/3) (0*99 - 5*98 | f | + 12f* - 8 j f | 8 ) for f > 1/4. 

(The decimal values are exact.) The value *05 is the class interval. Compare these 
equations with those obtained by Irwin’s method for a continuous rectangular 
uni verse f, viz. 

V_ _ f W3)(1 -24f*+48|f | s ) for | £ | S 1/4, 
df 1(16/3) (1 - 6 | f | + 12f* - 81 f | 3 ) for | f | £ 1/4. 

It should be stated that the values for the probabilities in the continuous universe 
are obtained by integration. For purposes of comparison with the distribution of 
means from a discrete universe they are shown in Table X in the column headed 

fydf. 

TABLE X. 


Distribution of Means of Samples of 4 
from a Rectangular Universe. 


.r 

* 

V 


0 

0 

•1360 

•1327 

+ *25 

± *05 

•1280 

*1257 

+ *5 

±*1 

■1088 

•1075- 

+ -75 

4-*15 

•0832 

•0829 

+ 1 

±*2 

*0560 

•0567 

±1*25 

4*25 

•0320 

0336 

+ 1-5 

±3 

•0160 

•0173 

±1*76 

±*35 

•0064 

•0074 

±2 

±*4 

•0016 

•0023 

±2*26 

±*46 

0 

•0003 

4-2*5 

±•5 

0 

•0000 


p o probability of given value of x (or () for a discrete universe of five classes. 

probability, for a continuous universe, that x (or f) will fall in the given 
class interval. 

* J. O. Irwin, “On the frequency distribution of the means of samples from a population having 
any law of frequency with finite moments, with special reference to Pearson's Type II,” Biometrika , Vol. 
xix. (1927), pp. 226—89. 
f See Irwin, loc. cit. p. 288. 
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TABLE XL 


Distribution of Medians of Samples 
of 3 from Rectangular Universe 
of ten Classes . 


Median 

* 

Probability 

0 

-•45 

•028 

1 

-•35 

•076 

2 

-•25 

•112 

3 

- -15 

•136 

4 

-•05 

•148 

6 

*05 

•148 

6 

•15 

•136 

7 

•25 

•112 

8 

•35 

•076 

9 

•45 

•028 


TABLE XII. 

Distribution of Ranges of Samples of 
4 from a Rectangular Universe . 


Range W 

V 

l<h(W)dW 

0 

•0010 

•0004Jj} 

1 

•0126 

•0115 

2 

•0400 

•0388 

3 

•0770 

•0757 

4 

•1164 

•116 

5 

•1510 

•1495 

6 

•1736 

•172 

7 

•1770 

•1753 

8 

•1540 

•1522 

9 

•0974 

•0955 

10 

— 

•0140ft 


p -- probability of given rango for discrete 
universe of ten classes. 
f<f> 2 (JP) rf probability, for a continuous 
universe, that range will fall in the 
given class interval. 


The distribution of means of samples of 3 for the discrete universe is fitted by 
the quadratic 

P(J) = f (9/4) (1-019 - 12£ 2 ) for | £ | < 1/6, 

1/15 ((27/8) (0*995 - 4 ] £ | + 4£») for | £ | > 1/6. 

The variable £ has the values 0, ±1/15, ±2/15, ±3/15, ±4/15, ±5/15, ±6/15. 
Irwin’s results for samples of 3 from a continuous universe are 

y f (9/4)(l-12£*)for|£|Sl/6, 
d£ 1(27/8) (1 - 41 £ | + 4£») for | £ | S1/6. 

For samples of 2 the result i9 the same for discrete and continuous universes, viz. 

P (?)/0’l = y/<J£ - 2(1 — 2|£|). 

It is worthy of note that the discontinuities in the polynomial representations 
occur at the same points in the discrete universe and in the continuous universe. 

Median *. Suppose we have a universe in which the variate takes values 
between 0 and 1, all such values being equally probable. Take a point X in the 
interval (0, 1). The probability that the variate will lie in the interval (0, X) is X 
and in ( X , 1) is 1 — X Consider 2& +1 values of the variate, X lf X t , X&+i. 
The probability that one of these values will fall in the interval (X, X + dX) is 


* Cf. E. S. Pearson and N. E. Adyanthaya, “The distribution of frequency constants in small 
samples from symmetrical populations, ” Biometrika , Vol. xx A . (1928), pp. 866—60. 





Paul R. Rider 


137 


(2fc + l)dX. Tiie probability that k of the remaining values will lie in (0, X) and 

^ ) X k (1 — X) 4 . Consequently the probability that the 
median will fall in the interval (X, X + dX) is 

(2* +1) X 4 (1 - X) 4 dX = X* (1 - X) 4 dX. 

Let us shift the origin to the centre of the interval by letting X = f +1 /2. Then 
X 4 (l-X) 4 -(1/4-£*)*> and 

is the probability that the median will lie in the interval (£, £ -f <2£). 

For samples of 3 from the discrete rectangular universe of ten classes the 
medians are distributed as shown in Table XI. The distribution is perfectly fitted 
by the parabola 

p (£)/0*l = 1*495 — 6£*. 

If 2k + 1 = 3, then k = 1, and the distribution of medians of samples from the 
continuous rectangular universe is given by 

1-5 - 6£®. 

Range * The probability distribution of ranges for samples of 4 from the 
rectangular distribution of ten classes is shown in Table XII in the column headed p . 
If the first two groups (range 0 and range 1) are combined, the distribution is 
perfectly fitted by the cubic 

p ( W)/ 0*001 = -12 W* + 120 W 1 - 2 W + 20, 
in which W = range. 

Table XII also gives, for comparison, the probability distribution of ranges for 
samples of 4 from a continuous rectangular universe. This has been determined 
by Neyman and E. S. Pearsonf. 

Their formula (xxxviii) on page 210 is 

<f> 2 (W)=*n(n-l)(W n ~ 2 /w n ~ l ) (1 - W/w), 

in which TF is the range of the sample and w the range of the universe. In our 
case n « 4, w = 10, and for fa ( W) d W we get 

fa(W)dW=* 0*012IF*(1 — 0*1 W)dW. 

By a simple integration we find the probability of each class. 

* Of. E. 8. Pearson and N< K. Adyanthaya, “The distribution of frequency constants in small 
samples from symmetrical populations/’ Biometriha , Vol. xxK (1928), pp. 866—60; also the article 
by Neyman and E. S. Pearson referred to in the next footnote. 

t J. Neyman and E. 8. Pearson, “On the use and interpretation of oertain test criteria for purposes 
of statistical inference,” Biometriha, Vol. xxA. (1928), pp. 176—240, 268— 94. 
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The substitution 10£ enables us more readily to compare the two equations. 
In the notation employed for means we have 

p (£)/o-l = - 12£ 3 + 12f* - 0*02f + 0 02, 
y/i£«-12£*+12£» 

Extreme Average . By extreme average is meant the mean value of the greatest 
and least variates in a sample. The distribution of extreme averages for a sample 
of 4 from the rectangular universe of five classes is shown in Table XIII. If E is 
the value of the extreme average, then the substitution 2?*5£ + 2 reduces the 
range of the extreme average, which is the same as that of the universe, to a unit 
interval centred on j£=2. The distribution is then fitted exactly by the cubic 

p (£)/0*l = - 32 | £ | !3 + 48£* - 23-84 | £ | + 3*92. 

Compare this with the equation 

/(?>/#»-82|£|* + 48£*-24|£| + 4 

for the distribution of the extreme averages of samples of 4 from a continuous 
rectangular universe, which may be deduced from a formula given by Dodd* 

TABLE XIII. 


Distribution of Extreme Averages for Samples of 4 
from Rectangular Universe of five Glasses. 


Extreme 
Average E 

£ 

625 x 

Probability 

Probability 

V 

0 

-•4 

1 

*0016 

•5 

— *3 

14 

•0224 

1 

-•2 

r>i 

*0816 

1*5 

- *1 

124 

•1984 

2 

0 

245 

*3920 

2-5 

•1 

124 

•1984 

3 

•2 

51 

•0816 

3-5 

•3 

14 

*0224 

4 

■4 

1 

•0016 


Greatest Variate and Least Variate . The distribution of greatest variate and 
that of least variate can be obtained from the correlation table of these two para¬ 
meters (Table XIV). Tho entire table is for a ten-class universe, the part set off 
by dotted lines is for a five-class universe. 

The regression surface is fitted (except for the diagonal of l’s) by the equation 
/(£, 7/) »12 (£—+ 2. If the table be made symmetric after tho manner described 
by Tippettf, the lack of fit along the diagonal will be overcome. 

* £. L. Dodd, “Functions of measurement under general laws of error,’* SkandinavUk Aktuarie- 
tidskrifu Vol. v. (1022), pp. 133 — 58 . The formula for a sample of n from a continuous rectangular 
universe of range unity is /(£)/d£=2 n “ 1 n (1/2 -1 {| ) n ~ 1 . 
t Biometrika , Vol. xvn. p. 881 . 
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Simultaneous Frequency Distribution of Greatest and Least Variates 
{Rectangular Universe). 


Least Variate. 



The entire table is for a ten-class universe, the part set off by dotted lines is for a five-class 
universe. 


General Conclusions, A number of statistical parameters (mean, median, range, 
extreme average, greatest variate and least variate) in samples from a rectangular 
universe are distributed in polynomials, which are apparently, except in the case of 
greatest variate and least variate, of degree one less than the number in the sample. 

There appears to be little difference in the distributions of these variates when 
the universe is continuous and when it is discrete. 

Geometric Methods. 

The distribution of means of samples from a continuous rectangular universe 
has been derived by Hall 11 by considering a sample of n as a point in space of 
n dimensions. The sampled population is represented by a unit hypercube. (See 
Fig. 4.) Let P be the point representing the sample (X lf X 2 , ..., X n ); let 01 
be the diagonal of the hypercube from the origin to the point /, each of whose 
coordinates is unity; and let PM be the perpendicular from P upon 01, Then 
the mean, X , and the standard deviation, s , of the sample are OM/OI and MP/OI 
respectively. Hall's method consists of ingeniously finding the content ofjthe 
(n — l)-dimensional region cut from the hypercube by the hyperplane %Xi~nX. 

* Philip Hall, “ The distribution of means of samples of size N drawn from a population in which 
the variate takes values between 0 and 1, all suoh values being equally probable,” Biometrika , Vol. xix. 
(1927), pp. 240—4. 
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In an Editorial Note* on the paper of Hall (and the paper of Irwin referred 
to above), E. S. Pearson has called attention to the fact that the frequency of a 
given value of the standard deviation of a sample could be obtained by integrating 
the density (unity for a rectangular universe) throughout the region for which MP 
is constant, which is a hypercylinder with axis 01 and cross-section an (w — 1)- 
dimensional hypersphere. 

i 


Fig. 4. 

Since the mean of the rectangular universe may be represented by O', the 
centre of the hypersphere, the deviation of the mean of the sample from the mean 
of the universe may be represented by O'M/OI. Thus the ratio z is represented by 
O'M/MP = cot a, where a= ZMO'P. The region for which z is constant may be 
described as a hypercone with vertex O' and axis coinciding with OJ, and the 
frequency distribution of z could be obtained if we could find the content of that 
part of the “surface” of this hypercone bounded by the hypercube f. An endeavour 
to do this seems to lead to extremely complicated integration even in the case of 
three dimensions, but the distribution of z, and also that of 8, the standard devia¬ 
tion, are derived for n = 2 in the hope that they may offer some clue to the forms 
of the distributions for larger values of n. 

For 2 (i.e. samples of 2) the hypercube degenerates into a square (see 
Fig. 5), and it is only necessary to find the length of the line AB for the 
frequency of s, and the length of the line O'C for the frequency of z . 

B I 


C 
A 

0 

Fig. 5. 

# Biometrika , Vol. xxx. (1927), pp. 244—6. 
t See Neyman and E. S. Pearson, loe. cit . 
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We find that AB = 01- 2MP = V2 (1 - 2«). 

The area of the unit aquare is 

. tilt _ rut 

2 I ABd (MP) = I 2V2(l-2a)V2ds= 4(1-2 s)ds. 

J MP» 0 J J 0 

Therefore the distribution of standard deviations of samples of 2 from a continuous 
rectangular universe is 

/(s)ds»4(l-2s)d*. 

Applying the law of sines to the triangle 00'C> we find that 

O'C = sin (tt /4) = _I/V 2 _ 

1/V2 sin (37r/4 — a) sin (37t/4) cos a — cos (37r/4) sin a ’ 
whence O'C = £cosec (a + 7t/4). 

The area of the unit square is 

4 f £ (O'C) 2 da * 2 f £ cosoc 2 (a + tt/4) da. 

Jo Jo 

The distribution of a is given by 

♦<•>*-* + * - 2 (»io « + ««Tj> ' 

and the distribution of z (*= - cot a) is 

/(*) («) d « s 2(i-]7ff ■ 

The cumulated probability of s is 


1 

2(1 +*) 


for £ ^ 0, 


2(1 — z) 


for z £ 0. 


It will be found that these distributions of s and z for samples of 2 from a 
continuous rectangular universe give very good fits to the corresponding distribu¬ 
tions for a discrete rectangularmniverse. 
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APPENDIX. DETAILED TABLE V. (See p. 131,) 


Probability Distribution of z (Mean of Sample —Mean of Universe) + (Standard 
Deviation of Sample) for Samples of 4 from Rectangular Universe having 
ten Glasses . 


(/ = 10,000 x probability of z) 
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APPENDIX. DETAILED TABLE V—Continued. (See p. 181.) 
(/= 10,000 x probability of z) 
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ALBINISM IN DOGS 


By KARL PEARSON and C. H. USHER. 

The long research on the experimental breeding of dogs initiated in 1905 by the 
late Edward Nettleship, and the authors of the present memoir*, and still carried 
on in connection with the Qalton Laboratory, is not yet ripe for complete publication. 
It has been protracted first on account of the heavy expense of keeping at the same 
time a large number of dogs, and secondly owing to the grave difficulties of the 
Great War, during which only a few dogs could be preserved for starting afresh 
when times permitted. This involved still closer inbreeding than we should have other¬ 
wise employed. The experiment has gone so far, however, that it seems possible to 
report on one aspect of the investigation, that of albinism in dogs. Our original 
stock consisted of Albino Pekinese and pure-bred black Pomeranians. These albino 
Pekinese had white coats, the hairs of which contained no pigment granules 
whatever; the eyes of all of them were characterised by strong red reflex, and 
marked photophobia. Indeed, on more than one occasion too long exposure to 
brilliant sunshine has produced something of the nature of a collapse in the dog, 
who has had to be carried home. The sight of these albinos has varied a good deal, 
some being so short-sighted that they were apt to run against obstacles, and many, 
if not all, found difficulty in catching pieces of biscuit thrown to them, which was 
accomplished promptly by their coloured brethren. This type of albino we shall 
term the Dondo or White Albino, it is the commonest of all our albinos. 

It has not been found possible, in the course of our twenty years’ experiments, 
to obtain dogs having albinotic eyes, and hairs of the coat which contain melanine 
pigment granules. Even light fawn coats, the hairs of which contain very few 
pigment granules, cannot be associated with albinotic eyes. On the other hand, we 
have on a few occasions obtained dogs with white coats, and fully pigmented eyes; but 
even in some of these few cases the tendency of the coat was to become pale yellow 
as the dog approached maturityf. No pigment granules are to be found in these 
yellow hairs, they appear to be a stage towards the paler fawns in which granules 
are scarcely to be found except of course in the black “ points,” if such are present. 

But there is one coat which has all the appearance of “some" colour, although it 
lacks entirely pigment granules, with which albinotic eyes can be associated. This 
colour may be described as something like pale millboard, although it has in certain 
aspects a tint not unlike the sky background of many photographic prints. Weldon 

* I gladly acknowledge the help that my share of the work has received from occasional grants from 
the Royal Sooiety Government Grant Committee. K.P. 

t The dogs with blaok eyes and white coats have in our experience been fonnd to be very delicate, 
the oritioal time being when the puppy-ooat is oast. 
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termed a similar skin colour iu his mice “ lilac” or pale blue grey and found it 
possible to associate it with albinotic eyes* This “lilac” shades off into pale 
chocolate, such as one reaches by biting off the end of a stick of that delicaey, and 
led' one of us to describe the dogs Hans and Grethel when puppies as of a scraped 
chocolate colour (see Plate V). I do not think any distinction can be made between 
the “lilac” and “scraped chocolate” ooats; it depends, in part, on the lighting by 
which they are seen, but more on the length of time since the last moult. The 
colouring is not due to pigment granules, but is of a diffused character, and possibly 
is due to lipochromes; it is like in nature to that of certain rather rare types of 
human red hair, and of socne chestnut hair in horses, which carry no pigment 
granules. This coat colour differs essentially from the cream or faint yellow tinge 
which Dondos acquire when much in the sun and lose as rapidly in winter or in¬ 
doors; a similar colouring occurs with the hair of human albinos under like conditions. 
All the dogs we have bred with the “lilac” coat have had albinotic eyes. They form 
the second type of albino dog and we have termed them Cornaz Spaniels, in memory 
of the distinguished Belgian ophthalmologist. The Cornaz Spaniel has a whole 
coloured coat, no dark “points” of any kindf, and the pink nose pad and flesh of the 
pure white Dondo. Both Dondo and Cornaz have, as a rule, “spectacle marks” of a 
brown colour. These are formed by oxudation from the lachrymal glands, and this is 
caused,possibly,by some weakness of the eyes. The “spectacle marks” stain the finger 
when touched, andean be washed off the coat; they increase with exposure to sunlight, 
and will practically disappear in the case of a bitch shut up with her litter. They 
are due to some oily form of exuded pigment, much like what is discharged from 
the pores between the toes of these same albinos. A study of this exuded pigment 
would be of interest, but it has nothing to do with the absence of melanine pigment 
granules, which constitutes the essential feature of albinism of the coat. 

We now reach the remaining canine albinotic type; it would be called a piebald, 
if the colouring were black and white. Actually the colouring is “Cornaz” and 
white, and we prefer to call it a skewbald J (see Plate VI). The reader must be 
warned at once that a skewbald albino is not the hybrid of a Cornaz and a Dondo. 
The Cornaz white skewbald has, in our experience, invariably albinotic eyes, and 
it appears in a litter, which, whether albinotic or coloured in its members, has other 
members piebald or skewbald. One of the remarkable facts about a skewbald albino 
is this: Two puppies axe born in the same litter and there is nothing whatever 
to distinguish between them; both are markedly skewbald albinos. The first 
moult comes, and one of these puppies will lose its “Cornaz” coloured areas, 
cease to be skewbald and remain during life a pure white Dondo; the other 
remains a skewbald after its first moult, and if it does so is a skewbald for 
ever. We have no explanation to offer of this phenomenon, it is an extreme case of 

* Biometrih Ira, Vol. xi. Appendix, p. 45 and Records, pp. 2 et seq . 

t By 44 points” we mean a Barker shade on ear8, head, vertebral column, etc. A Cornaz may some¬ 
times have a white shirt front, or a white belly. 

X By 44 piebald*’ or “skewbald,” we do not understand the existence merely of a white shirt front, 
white belly or white paws, but parti-oolouring stretching up to or even across the vertebral column. 

xxi 11 
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the fact that it is not feasible to describe accurately the coat colour of a dog till 
after the first moult*. We have now to consider these three types, (i) the Dondo, 
or pure white albino, (ii) tho Cornaz Spaniel, or “ lilac " albino, and (iii) the Skew¬ 
bald Albino, and to ask whether they may be looked upon as true breeding distinct 
types, or whether one can arise from the other; further, whether any of them can 
originate dogs with normal eyes, or pigmented coats. 

Let us start from our foundation stock, Tong (called Tong I later), Jack and 
Jill (see Plate I). These three were all white Pekinese albinos or dondos, with a 
tendency for the coat to become “creamy” in parts when it was old, or the dog 
much in the sun. Tong had two litters by Jack (not reproduced entirely in the 
pedigree which accompanies this paper). In the first she had five all white albinos, 
of whom: Tong II has only had coloured litters by coloured dogs but albinism 
has reappeared in her grandchildren; Mairi bhan was mated with her brother 
only; Ian ban mated with a black Pomeranian bitch Olga produced a black dog 
Donald dhu shown in our Pedigree I, and various black dogs with white shirt 
fronts; Ian ban mated with his sister Mairi bhan sired a litter of six albinos all 
male; these were all white albinos with the usual tendency to cream seasonally; 
of this litter Dngald ban mated with a black dog with white shirt front, offspring 
of Ian ban and Olga the black Pomeranian, gave among coloured puppies a white 
albino, Donald ban (“Idiot”), who sired for an albino Pekinese Spook a litter of five 
Dondos or whito albinos, the third dog Hamish ban of the three survivors was not 
mated. The second litter of Tong I by Jack consisted of two male albinos Wee 
Ling and Wee Choo , of whom no offspring, and Wee Tang a female (see the pedigree), 
all white albinos. Thus far the record of breeding albinos from Jack and Tong shows 
nothing at all inconsistent with white albinism being a recessive character in the 
Mendelian sense. 

We now turn to the matings of Jack with his sister Jill . These two, dog and 
bitch, produced five litters; there was nothing to distinguish Jack from Jill in coat 
colour, they were white albinos with the usual creamy appearance. They have both, 
of course, the same ancestry, entirely of coloured dogs, which does not link up with 
that of Tong I until we come to Ah Gum , who was imported from the Pekin Palace 
in 1896 (see Pedigree I), and may now be seen in the British Museum (Natural 
History). His coat in life was chostnut or bright sable red. We'naturally expected 
from Jack and Jill a repetition of the same results as from Jack and Tong /. The 
first litter, however, gave two albino males of which one who died at birth was said 
to be darkish down the back, and the second Wang (see pedigree) was our first 
Cornaz albino. Nettleship described the coat as a “rather dark cream,” but Pearson 
was very familiar with the dog as he came to him repeatedly for mating purposes 
and afterwards into his ownership; the colour of the coat was the “lilac” of Weldon s 
albino mice. Wang was the first Cornaz bred by us. Mated with Send dhu , a black 

* One is often obliged to describe the coat-colour of a puppy dying at birth or shortly after. This is 
done by a knowledge of what different puppy-coats normally become after first moult. An inexperienced 
observer wUl report a litter of three 11 black ” puppies, whioh one knows full well will be red brindles after 
the first moult. 
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dog with a white shirt front, the offspring of Wee Tong , a pure Pekinese white 
albino, and Donald dhu , a Pompek (see pedigree), he sired two puppies Hans and 
Orethel, our most characteristic Cornaz Spaniels; they both had the “bruised” or 
“scraped” chocolate coats, so characteristic of this sub-species of albino (see 
Plate V). 

The second litter of Jack and Jill was even more convincing in showing that 
in dogs white albino x white albino does not always breed true. The offspring were 
five albinos as far as the eyes were concerned; the first two, bitches (one unnamed and 
Fi) t had white coats, slightly tinged with cream, the next two Fe and Fo (see Plate II) 
were dogs and both skewbald albinos, that is to say, distinctly parti-coloured, 
the two colours being the white of white albino, and the grey or pale buff of the 
Cornaz; the fifth dog Fum was a Cornaz, slightly darker than his brother Wang. 
This second litter of Jack and Jill showed that dogs rated as white albinos would 
not necessarily breed true, but could produce both skewbald albinos and Cornaz 
albinos. Jill's third litter by Jack contained an albino which died at birth, probably 
but not certainly a white albino, and Patty (see Plate I) who was distinctly skew¬ 
bald at birth, but ultimately the coloured patches became whiter than the coat of 
either of her parents at 5 months old; at a year she must be called a white albino. Jill's 
fourth litter by Jack contained three dogs and a bitch, the first dog Fo /J, a skew¬ 
bald Cornaz at birth, became ultimately a pure white albino, the second Lo would 
be best described as a Cornaz with white markings, the third dog was a white albino 
and the fourth, the bitch, would probably have been a Cornaz. Jill's fifth litter 
consisted of four puppies, three bitches and a dog; the first Peggy an albino, “light 
coat with some pure white,” was imperfectly described by Nettleship; the next two 
bitches died at seven days and three weeks respectively of age, and Nettleship 
described them simply as albinos; the fourth, a male, was probably a Cornaz 
without white shirt front. 

The Jack and Jill matings gave litters which were unfortunate in the matter 
of survival*, probably it was due to the close inbreeding. Only two dogs Wang and 
Patty (see Plate I)—possibly Peggy who was given away at eight weeks and lost 
sight of—survived to maturity; the former pair were well known to one of us; they 
were respectively a Cornaz find a white albino. There is no doubt therefore that 
white albino mated with white albino can produce all three types of albinos. 

Now let us return to Jack . We mated him with another white albino bitch Mor 
bhan extracted from Donald dhu , the Pompek, and Wee Tong y the pure white albino 
Pekinese. The litter consisted of five white albino or dondo puppies (see Plate IV), 
all developing into the customary white adults with the usual cream tendency. 
Accordingly mating Jack with an extracted albino did not seem to give any greater 
chance, than when he was mated with Tong /, of a variation of the usual rule of 
white albino x white albino giving pure white albinos. Of the litter in question 

* The descriptions were also not as complete as was desirable. Nettleship was an ardent Mendelian; 
he was distressed and perplexed by these litters. Unfortunately he gave away as puppies, dogs which it 
was of the utmost importance to preserve. We have only a few photographs of these litters, a painting 
of Fe and one or two skins. 

11—2 
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Willie and Rob were very fine specimens of white albino dogs or dondos, Lassie and 
Meg bhan quite fair dondo bitches, the fifth, an albino dog, Tam, died at four months, 
from the effects of a kick from a pony. Meg bhan had seven fertile matingB. The 
first was with Larry, the produce of Wee Tong and Donnach ruadh, Wee Tong’s son, 
a red dog with black points, Larry being a dondo; this litter consisted of five of the 
usual type of white tending to cream albinos, Banshee , Hob, Imp, Pixie and Nixie, 
no Comaz in the litter. Her next litter was Bired by Ham the Cornaz albino and 
produced a bitch, who was Comaz with a white belly, and was nosed out of the nest, 
Bube , a Cornaz, of uniform “ scraped” chocolate coat, and with a very small white 
shirt front (when last seen his colour was lighter and resembled that of Wang), a bitch 
Amie, a white albino, still of a creamy tinge, and a dog Mendel , a pure white albino. 
Thus we see that a pure white albino, who was not descended from Jill, but who 
was descended from Jack through Wee Tung, could produce, mated with a Cornaz, 
both white and Cornaz albinos. For her third litter Meg was mated with the red 
dog Donnach ruadh and she had only albinos, two Comaz dogs and one dondo bitch 
Morse . In her fourth litter Meg had for mate the dondo Imp , and there resulted a 
dondo bitch that died almost at birth. For the fifth and sixth litters her mate was 
Bruno a light red dog with black markings, the son of the Cornaz albino Hans and of 
Norah * a brindle red bitch with black points. These two matings produced eleven 
dogs; one light red brindle male, a dark brindle male ( Jof ), a dark brindle female 
{Pet), a black male (belly a very dark brindle) with one white paw and brindle 
bracelets on forepaws who died at once, and another brindle male (Woodrow); the 
remaining six, albinos, consisted of three white albinos Kit, Haig and Komi all 
sufficiently “creamy” to show something like white shirt fronts, and two dondo 
bitches without sufficient cream to show white shirt fronts. Thus the mating of a 
dondo bitch with a red dog of albino descent from Jack did not provide us with any 
Cornaz albinos. 

Lastly, Meg was again mated with the Cornaz Hans ; she bred only two puppies, 
one, a male, died at once, it showed a white shirt front on a creamy coat, but would 
probably have become a white albino; the second, a dondo Ben , who has been one 
of our best pure white albinos. Meg shortly afterwards had an abortive litter, 
became sickly and had to be destroyed. 

We may now turn back to the more closely Cornaz matings. Wang and the black 
bitch Seni dhu besides giving rise to the Cornaz albinos Hans and Orethel , had in 
1913 as offspring the bitch Trine , another Cornaz and two other Cornaz puppies. 
Hans, the Comaz, mated with Wee Tong, the pure Pekinese dondo, gave rise to a 
pure white albino Hun (1915). Hun was mated with the Comaz bitch Liese, and 
the latter’s litter consisted of two Cornaz bitches, born in 1916, Hinne and Henne, 
both of whom had white shirt fronts. Meanwhile Trine mated with Bube gave 
birth to two Cornaz puppies, one of which died early unnamed, and the other was 
Pontina (“ Beb6 ”)f. 

* Norah was the daughter of Donnach ruadh and his litter sister Giortal ruadh, the first red dogs to 
reappear after the mating of dondoe and Pomeranians. 

t She was a whole colour Cornaz, exoept for a white shirt front. 
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It might be thought in view of these results that it would be easy to establish 
a true breed of Comaz Spaniels, but we have not really succeeded in doing so, and 
this for two reasons: first, the greater number of our Comaz spaniels were born 
during the war, when it was impossible for us to keep more than a few dogs, and 
secondly, the Cornaz bitches have proved less healthy and less fertile than even the 
dondo bitches. At present we have not a single pure Comaz in the kennels; and, 
perhaps, the only hope of recovering them lies in the bitches Jade and Juby t now 
growing old, to whom we shall return shortly. 

The reader must allow us to pass now for a little into the fringe of our 
stock of coloured dogs. Wee Tong , the pure Pekinese dondo, mated with her son 
Donnach ruadh gave rise in 1913 to a litter of which one member was an extremely 
fine red dog with black points, Patsy. Meanwhile the black Pompek, Donald dhu, 
mated with his daughter by Wee Tong , namely Mor dhu , a black dog with white 
points, gave rise to a red bitch with black points, Siri (see Pedigree I). Patsy and 
Siri helped largely to maintain our stud during the war years and to renew it 
afterwards. In 1920 they had a litter which contained among other coloured dogs: 
Siu Niu (“Topsy”), a fawn bitch with black points and Ghangpie , a grey brindle. 
In the litter of Patsy and Siri of the following year appeared Maureen , red with 
black points, a veritable Brunhilda, a very strong, fierce and bad-tempered bitch. 
With considerable difficulty and even danger to her spouse, she was mated with her 
brother Ghangpie and the litter consisted of two dogs, a white albino of the usual 
type, Mac , and a red dog with black points, Jamie . Meanwhile the dondo Ben had 
been mated twice with Topsy ; in 1922 there was a litter of three red puppies, of 
whom we are only concerned with the dog Eld , red with black points and the bitch 
Babs } red with white shirt front. The latter was a small dog like her mother. In 
1923 in a litter of two, appeared the bitch Setie , a fawn-red with white shirt front. 
For-sizc and length of leg she takes after her maternal aunt Maureen , but psychically 
is of the very opposite temperament. Her litters have been of great interest. Thrice 
mated with Eld, a litter brother of Babs, and therefore Setie’s full, although not 
litter, brother, she has given rise to white dogs with pigmented eyes. These, how¬ 
ever, do not largely concern us at present. Setie in 1926 was mated with Mac and 
had a son Wu, a typical white albino; there is nothing in coat or eyes to distin¬ 
guish Wu from any other of the dondos produced, since the original cross with the 
black Pomeranians*. Returning to Bobs , she was mated in 1923 with her own 
father, Ghangpie , or we mated a red bitch with a grey brindle; the result was one 
of the most remarkable litters we have had. It consisted of three puppies, one 
dog Stockie and two bitches Jade and Juby. All three puppies at birth and till the 
change of puppy-coat were markedly and undeniably skewbald. Stoche is white 
with grey-brindle patches as dark as his father's, Changpie's coat. He is in no 
respect an albino, but the skewbaldism extends to his eyes and he has hetero¬ 
chromia of the irides. A fuller account of him will be provided on another occasion. 
Meanwhile in Jade and Juby we had reached again our skewbald Cornaz dogs, but 

* They are larger dogs than the pure-bred Pekinese albinos, and not as graceful as these, or indeed 
as the Cornaz Spaniels. 
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no longer as product of two albinos, but of two coloured dogs. Unfortunately Jade 
lost her skewbaldism at first moult. She would now be described as an ordinary 
white albino or dondo bitch. Juby, on the other hand, is a typical skewbald albino, 
who has retained her “Cornaz” patches for six years. As we may naturally antici¬ 
pate, Jade and Juby, being bitches with the Cornaz coat-tending and closely 
inbred, are neither concupiscent nor prolific. But Juby , the skewbald albino, has 
been mated with the white albino Wu, the son of her half-brother Mac. Both 
Juby and Wu have the usual albinotic eyes, in both cases with marked red 
reflex. Juby only differs from her sister Jade , in that the latter has lost, while the 
former has retained, during life, the skewbald patches of Cornaz colour. Now by 
mating two dondos together, or mating two Cornaz spaniels with full Cornaz colour, 
or by mating a dondo with a Cornaz spaniel, we have never got a coloured dog in 
the litters, but by the mating of Wu and Juby , dondo with skewbald Cornaz, a 
remarkable litter has resultod wherein all the puppies arc coloured . The first, a 
male, Schwarzert , has black body colour with a white paw and certain tan markings, 
the second, a male, Brunert, has red body colour with black markings, while the 
third, the bitch Oelberta , has a somewhat brighter red for body colour and black 
points. The sire, dam and offspring as puppies are represented in the accompanying 
Plate VIII from the standpoint of colour rather than of form. Juby has had, through 
her mother, a very varied and continuous flow of albinotic ancestors; Ckangpie's 
albinotic ancestry is further removed and less intense, it goes back to Jack and 
Tong , but not to Jill . It cannot, however, be said that Juby has more coloured 
ancestry than Jack, Jill or Tong, nor can her coat be described in other terms 
than that of the skewbald albinos Fe and Fo . Our Plate VII gives her photographs 
together with two of Wu. She is just what we have described her to be, a skew¬ 
bald Cornaz. 

Before we enter a little more fully into the question of Cornaz pigmentation, 
we should like to remark on three points. Our first point is that the excessive in- 
breeding, which we have been driven to by the Great War, and the great cost of 
maintaining a canine stud of adequate numbers, does appear to increase the feeble¬ 
ness of the dogs and emphasise their infertility and the heavy mortality in the 
litters, especially of the albinotic dogs. Secondly, this very inbreeding appears to 
increase the interest of the progeny; the closer the inbreeding the more likely we 
seem to get interesting results, e.g. Jack and Jill, Hans and Orethel , Setie and Eld, 
Babs and Changpie , Juby and Wu. Rare factors which would not otherwise be re¬ 
enforced are thus brought to light. Thirdly, but this is only a surmise, which 
cannot be adequately maintained from the evidence adduced in this paper, the first 
litter of a pair of dogs is more likely to produce varieties than later litters. On 
this and other grounds*, which have come to light in dog-breeding experiments, 

* One we may mention here: the spermatogonia form a community of living organisms; it is con- 
trary to experience that suoh a community should not exhibit individuality among its members. 
JenningB believed he had reached such a state of affairs in his Paramecium experiments, bat his further 
researches on rhizopods, where he had a wider range of observable characters, oonvinoed him, as the 
observations of Johannsen on Phateolus vulgaris and Hanel on Hydra grisea (see Biometrika , Vol. u. 
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we are not yet prepared to accept an atom-like identity in the spermatogonia of an 
organism, i.e. that an identical factorial formula is all they contribute to inherit¬ 
ance. We have been impressed with the need of supposing that there is a certain 
amount of individuality in the ultimate source of the germ cells, and that during 
the life of an organism these individualities may be present at different periods in 
different proportions, and this population of cells be even subject to some form of 
selection before the development of the spermatozoa, and the spermatozoa be again 
intensely selected before they reach individual ova, which in their turn have passed 
through various stages of selection. 

Whether piebaldism be determined by a unit factor or not, we know that the 
extent of white in the coat is a continuous character inherited as stature or cubit 
in man. Those who have been in close touch with many dogs will, wo think, agree 
with us when we say no two body colours appear exactly alike, you may look 
through dozens of skins and find individuality in the shade of them all. The 
classification into reds, fawns, red brindles, grey brindles, and so forth is broad and 
superficial, there is really individuality in every case, and what is more, that indi¬ 
viduality is not a “ fluctuating variation there is less difference between the 
“red,” say, individualities in brethren than between the “reds” of cousins. Just 
as the various types of chestnut in hackneys are individually inherited, and a 
single factor for chestnut only serves to screen this, so it is with coat colour in 
dogs. 

We preface our remarks on albinism in dogs with the above statement, because 
there are undoubtedly grades in the tint of Cornaz Spaniels. It would be impossible 
to confuse the coats of Hans and Ben or of Orethel and Wee Tong , but within the 
albinotic type the intensity of the Cornaz colour varies, and we have seen that in 
skewbald puppies like Patty and Jade, it may disappear, and as adults they may 
be merely white albinos or dondos. Even a Dondo may show at times considerable 
“creamy” areas, though we do not think such tints can be confused with the 
“ scraped ” chocolate of Hans or Orethel. Further the dog, who would be distinctly 
classed as a Cornaz Spaniel, may not be whole colour like Bube, but have a white 
shirt front like Hinne or Henne , which is far from being the full skewbaldism of 
Fe or July. If, as in the case of the early writers on albinism, we look upon that 
condition as an arrest of development, we should say that a Dondo denotes a com¬ 
plete arrest from the earliest development before birth, and that the various forms 
of skewbald Cornaz and complete Cornaz mark arrested attempts at pigmentation 
development; this is not out of accord with what we know of the microscopic 
examination of the Cornaz hair, to be discussed later. If we put therefore on one 
side the white “ points ” and the skewbaldism of the Cornaz, or attribute them, 
Mendelian fashion, to two factors independent of albinism as they occur in coloured 
dogs as well as albinos, we are left with two forms of albinism—the Dondo and the 

pp. 49a—503, Vol. vn. pp. 878—381) had already convinced Pearson, that the hypothesis of uniformity, 
i.e. absenoe of individuality, in the hereditary mechanism of any organism has a high degree of 
improbability. 
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Cornaz. Let us try and see whether it is feasible to elucidate the relationship of 
Comaz and Dondo on a two-factor hypothesis. Let us suppose white dominant ( D) 
and Cornaz recessive (R) *, then however much we should like to take Jack and Tong 
as (I)D)’b owing to their invariable production of Dondos, this is impossible, 
because in order that Jill should be white as she was, she must have been a (DR) and 
thus Jack and Jill could only give (DD)' s and (DR)'$ or no Cornaz spaniels. Jack 
and Jill must accordingly be treated as both heterozygotes or (Z>i£)’s, and thus it 
would be possible for them to produce Dondos and Cornaz spaniels. Jack and Tong's 
progeny were solely Dondos, hence it seems reasonable to suppose that Tong was a 
(DD). Donald dhu although a black dog had a factor for albinism due to his father 
Ian ban , the nature of this factor it is difficult to determine, but if it contained 
Cornaz it is surprising that all the offspring of his daughter Mor Man with Jack 
were Dondos. In fact of all the albinos—and there are many born from the offspring 
of Donald dhu —not one was a Cornaz except those resulting from the mating of 
Send dhu with a Cornaz Wang . Here we seem to meet with a result which needs 
much explanation. For while we have assumed Cornaz to be recessive to white, it 
appears dominant to black, for all the offspring of this mating, Trine , Hans and 
Grethel , are not only Cornaz in appearance but appear to be homozygotic Cornazs 
(RR) for their litters arc all Cornazs. If as seems probable by the nature of their 
offspring, Wee Tong , Ian ban , Mor bhan and Meg bhan were all homozygous Dondos, 
then the mating of Hans and Meg bhan should have given nothing but Dondos, 
whereas it gave two Cornazs as well as Dondos like Ben and Mendel. Further, if 
Dondo was dominant to Cornaz, then the result of the Hun and Liese matiug may 
be looked on by some with suspicion. If we treat Meg bhan as heterozygous, then 
Ben and Bube are possible theoretically as brothers, but Ben must also be hetero- 
zygous. Now Juby must, being Comaz, be a homozygous recessive ; hence a second 
factor for Cornaz must have been handed down to Ghangpie , and so we are ultimately 
again forced to consider that Ian ban or Wee Tong had a factor for Cornaz. We 
are led to suggest, indeed, that all the Dondos are heterozygous—a not uncommon 
suggestion to make, when we find as in the inheritance of anomalies or diseases in 
man, that it is impossible to interpret the pedigree, if we suppose any individual 
to carry only dominant factors. 

We now turn to some of the descendants of Donald dhu , who while he was de¬ 
scended from Jack r, had no connection except through Jack with Wang. He was 
chiefly mated with his own daughters Cilis dhu , Mor dhu and Send dhu out of Wee 
Tong. Cilis dhu was a black bitch with white shirt front. Donald dhu was mated with 
Cilis; and in the first litter (1913) there wore four offspring, Loki and Thor black 
dogs, an unnamed red dog and Freyja a white albino bitch, the hairs of whose coat 
showed no diffused pigment and no granules, though one or two of the more creamy 
hairs showed a very faint brown diffused pigment. She was of the usual dondo type. 
The second litter (February 1914) contained only two puppies, Odin a fine brindle 
sable dog and Frigg. We described this puppy at three months as having a white 

* The inverse hypothesis that Cornaz is dominant will clearly not suffice, for then Jack and Jill 
would have to be pure recessives, which will dearly not work. 
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coat slightly tinted with cream, but “not a Cornaz spaniel, mere ordinary albino.” 
But the bitch underwent a complete change when she lost her puppy-coat. On 
October 30, 1914, we described her as nearly whole colour, bruised chocolate, a real 
Cornaz Spaniel, very good coat with long hair and underhung jaw like her father 
Donald dhu. She grew very fat, failed to be mated and died March 1915; the post* 
mortem showed the intestines almost choked with fat. Her skin, preserved, is that 
of a very dark Cornaz spaniel; she had the usual albinotic eyes. This is the only 
case we know in which a dog, born apparently a Dondo, has afterwards developed a 
Cornaz coat, although the reverse transition has occurred more frequently *. In her 
third litter sired by Donald dhu , Cilis produced only coloured dogs, five in number, 
three sable and two liver coloured, and into their description we need not enter at 
present. Cilis in 1915 was mated with Wang and produced only three puppies, one 
Bran a black dog with white shirt front and white left-hand fore-paw,and two puppies 
which died on the day after birth, one a fawn and the other an albino, most probably 
Cornaz. Lastly, Cilis had a fifth litter of three by Donald dhu in 1916, all of whom 
she overlaid ; it consisted of two black puppies, and a whole white albino bitch. It is 
thus clear that a black bitch mated with a black dog can produce albinos of both 
types. Donald dhu was again mated in 1913 with his aunt Wee Tong , the pure-bred 
Pekinese dondo. In the litter of three all were albinos; namely Dargo , a Cornaz 
Spaniel with white shirt front and paws, the rest of the body uniform lilac, 8ora t a 
dog of perfectly whole Cornaz coat, and no sign of white markings, and Minona f a 
white albino bitch with slight creamy touches. Dargo , Sora and Minona had all 
the usual albinotic eyes. We see therefore that a white albino or Dondo crossed 
with a black dog can give either a Dondo or a Cornaz albino. Further, of the 14 
offspring of two Pompeks, Cilis and Donald dhu , four were black with white shirt 
fronts like their parents; five had sable coats, one was liver, one light red and three 
were albinos, namely two Dondos and one Cornaz. 

Crossed with his daughter Beni dhu } Donald dhu had in 1914 four offspring: 
Teufel a black dog with a small white shirt front, two black dogs with white 
markings, and a light red brindle bitch with white shirt front and white paws. Thus 
in this litter there was not a single albino. We cannot, however, assume Beni to 
have had no factor for albinism, because crossed with Wang the Cornaz albino, 
she had in 1913 a litter of three: the bitch Trine , a Cornaz, and a Cornaz bitch 
and dog, who died quite young. Crossed again with Wang in 1915, she had a litter 
of five, one bitch and four dogs. The bitch was jet black with a white shirt front, 
the four males were three of them Cornaz and the fourth was a dark brindle with 
white front. 

The third black-coated sister of Cilis and Beni, namely Mor dhu , declined to be 
mated until she was three years old. She was then crossed with her sire Donald 
dhu and produced in 1915 a litter of three: Ola, a black dog with white front, Tolli, 
a similarly coated dog, and Siri , a red brindled bitch (see Pedigree I). In early 1916, 
Mor dhu gave two further puppies to Donald dhu , Hille a black bitch with a white 

* There exist several more or less well authenticated reports of human albinos acquiring some 
pigmentation. Frigg 9 a case may throw light on these. 
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shirt front, and a second jet black* bitch with a like front. In late 1916 and in 1917 
Mor dhu had single puppy litters; in the former year’s late litter a jet black bitch 
with white shirt front, and in the latter year a dog of jet black coat with a similar 
shirt front. The infertility of Mor dim— whother due to the difficulty of procuring a 
good meat diet for dogs during the war, or, as we suspect, to some personal anomaly f— 
was so great, that in the course of five years she had only seven puppies. The whole 
of these were of the usual Pompek type except Siri , who was a red brindle. The 
existence of Sin showed that she was not a dominant black, but could like her 
sisters Gilis and Sene produce red puppies. It would have been interesting to know 
what would have happened had she been mated with Wang, but her fertility was 
exhausted before this could take place. She may have carried no factor for albinism, 
but of this we cannot be sure. The only white hairs she carried formed a small 
patch on her belly, invisible as she ran about, so that she looked like a black pure¬ 
bred Pekinese. Her absence of white may possibly be correlated with the fact that 
she bred to Donald dim , her father, far more black dogs, than her sisters Gilis dhu 
and Send dhu did. The actual distribution of colours of offspring is shown in the 
accompanying Table. 


Mated with Wang (Cornaz) 

Name of 
Bitch 

Mated with Donald dhu (Pompek) 

Total 

Cornaz 

White 

Albino 

Rod 

Black 

Black 

Red 

“Sable” 

White 

Albino 

Cornaz 

Total 






Mor dhu 

0 

1 




7 

3 

1 

_ 

1 

1 

Cilix dhu 

4 

2 

• r >t 

2 

1 

14 

12 

8 

2 

1 

1 

Sene dhu 

3 

1 


— 

— 

4 


X Brown with more reddish tinge than is to be found in the natural fur of the sable, but distinguish¬ 
able from dark red. Not unlike, but far from wholly like, the coat of a brown retriever. 


It is clear from this table that the Pompek Donald dhu and his two daughters 
Gilis dhu and Send dhu all carried a factor for albinism. It is less certain about 
Mor dhu , but we might also have supposed her sister Sent dhu to carry no such 
factor, had she not been mated with Wang. Theoretically, Donald dhu, being the F x 
generation from albino Pekinese ( RR) and black Pomeranian ( DD), should have 
a formula of the type (DR), so that when crossed again with the pure albino (RR), 
his offspring should be (DR) or (RR). Thus, if D signifies “some” colour, this would 
cover red as well as black, and there would be nothing anomalous in the observed 
offspring of Donald dhu and an albino Pekinese. According to this theoretical view 
all Donald dhis daughters must also have the formula (DR) and mated with their 
father should have given 75 °/ 0 coloured and 25 °/ 0 albinotic offspring, the observed 
numbers were 22 to 3. Again these daughters crossed by Wang (RR) should give 
50 °/o albinotic dogs, the actual numbers were 4 to 11. The odds are about 14 to l 

* We use the term ‘‘jet” to distinguish the pure blaek ooat from the rusty black whioh is not un¬ 
common with Pompeks. 

f She was a most unsatisfactory mother, paying no regard whatever to her offspring. 
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against suoh an excess of coloured puppies in the first, and 27 to 1 against 
such an excess of albinos in the second form of crossing. There should be on the 
simple Mendelian theory applied no interdependence of the two sets of results, and 
accordingly the odds against the combined result following such a simple Mendelian 
theory are very considerable, i.e. 378 to 1; and these odds would still be considerable, 
i.e. 94 to 1, if instead of taking the excesses of coloured and albinotic puppies, we 
took for calculating our odds deviations whether in excess or defect. 

It may be asked why we should endeavour to apply any such simple Mendelian 
theory. The answer is that albinism is still cited in elementary (and not only ele¬ 
mentary) textbooks of genetics as an illustration of this simple form of Mendelian. 
theoiy. 

In order to meet a difficulty in mice very similar to what we have found in 
dogs, Bateson in 1903* introduced a somewhat different notation and definitions. 
He speaks of an albino (pink-eyed) gamete 0, this we suppose corresponds to our 
white albino with pink eyes, and O' which is described as a colour-bearing pink¬ 
eyed gamete. Then according to Bateson the homozygotes GO and O'O' will all 
have pink eyes, but the former will have white coats, and the latter some colour in 
the coat. The heterozygotes (00') will, according to Bateson, have dark eyes and 
some colour in the coats. In other words, it would appear as if some colour in the 
coats dominated the eye colour in the heterozygotes (0'G) ) but not the eye colour 
in the homozygotes ( G'O') when these are crossed. This as Weldon pointed out at 
the time is a somewhat curious interpretation of dominance; it also leaves very 
vague what is meant by “some colour” in the coat. The Comaz spiniel no more 
than the white Pekinese albino has any pigment granules in its coat; the "some 
colour” is diffused pigment, and whether this pigment is creamy yellow as it occurs 
in some white albinos at certain seasons, or grey as in the Comaz, there are no 
granules. Pigment granules do occur in the eyes which Bateson calls "pink” of both 
dogs and men. It is clear that our Comaz albinos must be homozygotes (O'O'), for if 
they were heterozygotes (G'O) they would not have "pink” eyes. Now if the reader 
will look at the lowest generation on Pedigree I he will see a mating between a white 
albino ("pink-eyed”) (00) Wu, and a "pink-eyed” Comaz with "some colour in 
the coat” (O'O') Jvby . The result was three offspring with heavily pigmented eyes 
and coats with plenty of pigment granules in the hairs (GO'), according to Bateson’s 
theory. Thus this remarkable result in dogs (and mice) is in accord with Bateson’s 
theory. But alas! the theory fails to accord at many other points with observation. 
According to the theory every Cornaz must be a homozygote and Cornaz x Comaz 
give only Cornaz, but Comaz x White Albino should give dark-eyed offspring with 
some colour in the coat. Let us look at some of the results: Hans and Orethel were 
very typical Comazs, they produced in 1915 three definite Cornazs; Trine and Bvbe 
again were quite typical Cornazs, they produced in 1914 two Cornazs—these results 
are all in accordance with Bateson’s view. But Hans , a Comaz, mated with a white 

* Nature, Vol. lxvii. p. 462. For the controversy which followed, see p. 612 (Weldon), p. 685 
(Bateson), p. 610 (Weldon), and Vol. lxvui. p. 88 (Bateson and Weldon). 
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albino Meg bhan, i.c. ( O'O' ) x (00), gave in 1915 Amie and Mendel , two white albinos 
(GO) and Bube and another dog, two Cornazs, one with a white shirt front and the 
other with a white belly. All four should have had coloured coats and pigmented eyes! 
In 1915 Hans (O'O') x Wee Tong (00) produced Hun a white albino (00) and Hans 
mated with Meg bhan (00) in 1920 a typical white albino Ben. Further Hun , a white 
albino mated with Liese , a Cornaz bitch gave rise to a litter of two Cornaz bitches, 
Hinne and Henne , with white shirt fronts instead of dogs with coloured eyes. It 
is clear therefore that Bateson’s [(?, O'] hypothesis will not suffice to describe the 
relations of Cornazs and white albinos in dogs; yet our Cornaz spaniels seem to 
accord closely with Weldon s pink-eyed “lilac” mice. Bateson wrote* of his [(7, O'] 
hypothesis in 1903 that: “Anyone conversant with Mendelian phenomena can now 
predict the eye colour of the future offspring of the various unions with approximate 
accuracy.” Discarding the [ 0 , O'] hypothesis as unworkable, it would be of interest 
to have a prediction as to the nature of the offspring which Oelberta may produce 
when mated with Schwarzert or Brunert. 

It will be seen that even on Batesons hypothesis the inheritance of pigment in 
the coat and of pigment in the eye arc not independent characters; nor are diffused 
pigment and granular pigment independent of each other. Two animals with 
albinotic eyes can give rise to offspring with deeply pigmented eyes, and this 
possibility depends on there being diffused pigment in the coat of one of them. We 
have never got a pigmented eye in the offspring of two Dondos or white albinos, 
and only as the reader will perceive very rarely from a Cornaz albino and a white 
albino. But as it. appears possible to obtain a Cornaz from two white albinos, and 
from a Cornaz and a white albino offspring with heavily pigmented coat and eyes, it 
would seem within the range of possibility to obtain pigmented offspring from a race 
of white albinos, at any rate in dogs. We say a possibility, because when this has 
come about in our breeding, both the white albino Wu and the Cornaz piebald Juby 
were extracted albinos, i.e, their parents in both cases were coloured dogs. It is 
conceivable in this matter that an extracted albino, especially when the ancestry 
contains a number of dogs with heavily pigmented coats, may not have the same 
germinal constitution as pure-bred albinos. 

To obtain some idea of the exact relation of eye pigment to hair pigment, 
attempts were made to obtain dogs with white albinotic coats and pigmented 
eyes. Various attempts were made to attain this end, but Pearson personally was un¬ 
successful in any experimental mating directed to this end. Only one mating, that of 
two red dogs, non-litter brother and sister from the same parents, gave, as it were 
by chance, the desired result. Topsy, a light fawn bitch with black points, crossed by 
the white albino Ben gave birth in separate litters to the red bitch Setie and the red 
dog Eld . There seemed nothing remarkable about this pair, but they were mated 
together to see whether a dominant red dog had been by any chance reached. All 
that was expected from the mating was the usual mixture of red dogs and white 
albinos with possibly a fawn. Judge of our surprise, when what we had failed to 
procure of purpose arrived indirectly. The record of the matings is the following: 

* Nature , Vol. Lxvn. p. 462. 
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In the five litters sired by Eld (1924, 1926,1927, 1928 and 1929) Setie has 
given birth to 16 puppies: 


White Albinos 

White coats 
and dark eyes 

Bed brindles 

Fawns and 
Fawn brindles 

Brown brindle 

2 

3 

4 

5 

1 


This is another illustration that one cannot take albinism and colour as alle¬ 
lomorphic. The three dogs with dark eyes and “white” coats were all males; as 
puppies they had coats as white as the whitest albinos, but on losing the puppy-coat 
they became creamy or yellowish in patches. The possibility of perpetuating a 
breed of white-coated dogs with black eyes could not be attempted, because all 
three were males, and Busdubh and Mike II died when about a year and about 
four months old respectively, in both cases suddenly. Tweedledee is still alive. 
Setie, however, was mated in early 1927 with her own son Nick , but with no success 
as far as white-coated, dark-eyed offspring go; she had one brindle fawn dog, and 
besides Alpha and Gamma , dogs, and Beta, a bitch, all three brown red with black 
points and white shirt fronts. Further, mated with her uncle Changpie, a grey 
brindle, in 1925, she had two puppies, one a grey brindle dog Seeta , and the other a red 
brindle bitch. To test whether she had a factor for albinism she was mated in early 
1926 with her double cousin the white albino Mac, the result was a litter of three, 
a white albino Wu , a Cornaz albino and a fawn with black points, Feng , all dogs. 
Omitting her mating with Mac as an albino, Setie, a coloured dog carring a factor 
for albinism, has been mated with three coloured dogs Eld, Nick and Changpie, 
all carrying a factor for albinism. She has had by them 21 puppies, of which two 
only were albinos, the odds here are about 12 to 1, or only in about 13 trials would 
the result once fall so far short of a Mendelian quarter. These odds are not very great, 
but they are in the same direction as those previously reached for the mating of 
two coloured dogs each carrying a factor for albinism, and confirm the view that 
albinism in dogs is not a simple Mendelian recessive, i.e. the white albinotic coat 
is not recessive to a coat with “some colour in the hair.” 

We may remark that the hair of the white coats of our three dark-eyed dogs does 
not seem to differ in any character from that of very creamy white albinos. 

We now turn to the dogs bred by C. H. Usher in Aberdeen. While they 
started from the same foundation stock as those bred by Pearson, their aim was 
somewhat different. Much more Pom blood was introduced through the two black 
Pom bitches Olga and Dido and the black Pom sires Prince and Drum Chief, but 
also through the white Pom Lady used with a view of getting ultimately a Pompek 
with pure white coat and black eyes. 

A study of Pedigree II indicates that Usher’s stock of Albino Pekinese takes its 
origin in Ah Cum . Mairi bhan and Ian ban, non-litter brother and sister, were off- 
spring of Jack and Tong . Spook, a bitch that had the whitest coat of all our dondos 
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and was the worst of all our mothers, few of her offspring surviving, was the grand¬ 
daughter of Tong by Mia of Alderbourne, through coloured parents. Beenie, a Dondo 
bitch, was a granddaughter of Wee Wong through coloured parents. Wee Wong was 
a red dog, litter brother to Jack and Jill. Wee Wong must have had a factor for 
Cornaz albinism, like his nephew Wang , because Beenie gave rise to Comaz albinos. 
A noteworthy difference will be found between Pedigree I and Pedigree II. While 
there are quite a considerable number of red and fawn dogs in Pedigree I, only 
about 2 °/ 0 of the dogs bred by Usher appearing in this pedigree are red or fawn. 
Of course, neither of the pedigrees now published contains anything like the whole 
number of dogs bred by either Usher or Pearson, the lines have been selected to 
illustrate the manner in which albinos occur; nevertheless the fact remains that 
Usher, breeding in more Pom blood and using more his F x generation of Pompeks, 
failed to obtain the variety of coloured dogs appearing in the London stud. In the 
present Pedigree II the coloured dogs are, first, the bitch Sheila , light brown coat 
of various intensities of brown, called by Usher shades of sable, and with brown 
hairs on vertebral column tipped with very dark black, so that she seems to have 
black “curls” down her back; secondly, a dark red brindle puppy, the offspring of 
Sim and Caristiona , and lastly among the somewhat weird offspring of Sim and 
Lady , a skewbald fawn and white dog, and a white dog with fawn markings; these 
are all. The other coloured dogs in this pedigree are black with white markings, i.e. 
Pompeks. Such dogs have almost disappeared from Pearsons stud*, presumably 
as a result of emphasising the Pekinese element. 

Ushers results from the offspring of Jack and Tong were at first remarkably in 
accordance with the simple Mendelian theory. Two white albinos interbred gave 
rise to white albinos Mam bhan and Ian ban . These interbred gave rise to a litter 
of six albinos, all white. Ian ban, mated with the pure bred black Pomeranian 
bitches Olga and Dido , sired seven Pompeks, black dogs with some white markings. 
Dido mated with Donald ban, also a white albino Pekinese, produced five typical 
Pompeks all black coated with white shirt fronts. Simple Mendelistn seemed to go 
well, our “dominants” black Pomeranians bred true, our recessives white albino 
Pekinese bred true and the F 1 generation had most of their coat black, if they added 
indeed the white shirt front unknown to the dominant Pomeranians. Only two ex¬ 
ceptions have occurred to this rule, the black Pomeranian Princel, gotout of the white 
albino Beenie two dogs; Coinneach ruadh {Kenneth) whose coat was chocolate, and a 
second unnamed dog with like coat; the remaining three dogs of the litter were the 
usual Pompeks. Kenneth failed to breed and the matter could not be carried further. 
So far, so good. Before we turn to the F 2 generation, we may note that Usher 
mated his white albino Donald ban with two other white albino bitches. Spook was 
descended from our albino Tong , and Beenie from Wee Wong , the brother of Jack 
and Jill ; both Spook and Beenie traced their descent through coloured dogs. 
Donald ban and Beenie mated provided four dogs all Cornaz, not a single white 
albino! Donald ban and Spook mated resulted in eleven Cornaz and only two 

* I believe the only black dogs now living bred by me are Jet, born in 1922, and Schwarzert , bom 
in 1928, both extracted blacks and differing in many respects from Pompeks. K.P. 
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white albinos. It does not seem possible to suppose Cornaz albinism an allelomorph 
to white albinism 1 The white albino Beenie crossed with the Pompek Dugald dhu 
should have produced 50% Pompeks and 50% white albinos; actually she had three 
black puppies with white markings and three Cornaz, not white, albinos. Crossing 
members of the F% generation we find a new series of difficulties arise, if we 
endeavour to apply any simple Mendelian formula. A not unreasonable or impossible 
result was obtained when Buna dhu was mated with Donald dhu, the litter consisted 
of five black dogs, all with white shirt fronts; if we suppose that these were all Pom¬ 
peks, or heterozygotic in character, for the dominants were whole black, the odds 
against this occurrence would be 31 to 1, and it might of course occur. The two 
matings of the Pompeks Ian dhu with Yfa dhu cause more difficulty, for the al¬ 
binism of both only arises from the Jack x Tong stock, yet we find in their progeny 
one white albino, one Cornaz albino, a Pompek and the light brown (sable) bitch 
Simla, with long black tips to the hairs on top of her saddle. Clearly the concep¬ 
tion of black allelomorphic to white albino is not adequate and colour factors for 
reds, red brindles, etc., unknown to the Pomeranian, come in through the albino 
Pekinese. Turning now to the matings of albino Pekinese with Pompek, we have 
already referred to the litter of Dugald dhu and Beenie ; in a litter of Dugald ban 
and Mairi dhu there were three Pompeks and a white albino Qarthonzie bhan. The 
latter mated in 1913 with Wai (also a white albino out of Spook by Donald ban, 
both pure Pekinese white albinos), produced three Cornaz dogs and a white albino 
bitch Ordchoinachie , thus confirming the view that white albinos can give rise to 
Cornaz albinos. If we suppose Cornaz albinism can arise through the dilution of 
albinism by the Pomeranian black, then Oarthomie*8 descent from Mairi dhu 
and Olga , the latters Pomeranian mother, can assist the hypothesis. On the other 
hand we have to note that Spook and Donald ban , both pure white Pekinese albinos, 
obtained Cornaz offspring without any dilution by Pomeranian blood. Ian ban and 
Garthonzie bhan gave in three litters six Cornaz and not a single white albino. 
Spook mated with Prince I, a black Pomeranian, gave rise to a litter of five Pom¬ 
peks, also black with white shirt fronts. One of these Anna dhu , mated with a pure 
Pekinese white albino Donald ban, had a litter of two Pompeks and one Cornaz. 
Thus Usher’s albinos tended during the war years, much like Pearson’s, to be Cor¬ 
naz rather than pure white, and both were subject to a very heavy death-rate, 
apparently greater in the case of the Cornaz than in that of the white albino. 

We may now stay to consider an attempt to obtain directly pigmented eyes 
with a Pekinese habit. In 1915 Usher mated a pure white Pomeranian with 
a Pompek of the F x generation, Fraoch dhu , see Pedigree II. The litter was note¬ 
worthy; all members had dark eyes. Sim was a piebald black and white with white 
hair; Caristiona , of the usual Pompek type; the third puppy,a dog, was also aPompek, 
mostly black with thin white stripe on throat, chest and belly; the last puppy, Mor 
bhan II, was white with slight traces of cream, she had on the left ear brown hairs 
with some black, and across the rump a pale grey band. Caristiona mated with her 
brother Sim gave birth to a dark red brindle dog, a Cornaz and a Pompek, black 
with white forepaws. Thus Sim and Caristiona both contained a factor for albinism 
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of the eye, but the normal dark eye dominated when Lady was mated with either 
a hybrid or a pure albino. Sim was mated with his mother Lady in the war years 
1017 and 1918, and this bitch had puppies: Tam II, a skewbald white and fawn dog, 
perhaps the use of the term skewbald is too definite—he had a white coat with light 
fawn patches, not very clearly marked off from the white, fawn-coloured ears, and 
a grey right front paw; two dogs and a bitch had white coats and pigmented eyes; 
a bitch Tibbie was all white except for fawn-coloured ears; Neil was a piebald like 
his father Sim, while Oscar and Marcus were all white, with brown noses. All the 
eyes of the eight puppies were dark. 

Finally in 1919 Lady was covered by Donnchadh ban, a Cornaz albino, and she 
had three white puppies, two bitches and one dog, all of which died in the first 
fortnight. Their skins have not all the same degree of whiteness, one being almost 
Cornaz colour. Ushers descriptions, macroscopic and microscopic of the eyes, seem 
to indicate that they all would have had “dark” eyes*. Lady had dark brown irides, 
but they were not examined after death, though those of the black Pomeranian 
bitches Dido and Olga were. It is difficult to compare a puppy’s eye with that of a 
fully grown dog, but it is just possible that the albinism of the father Donnchadh 
ban may have had some influence on the pigmentation of the offspring. Still the 
evidence seems in favour of terming these dogs “ white and dark eyed.” Unfortunately 
Lady was never mated with a pure white albino Pekinese, and her offspring with the 
Cornaz albino did not survive. The experiments with Lady were carried on during 
the war years 1915—1919, during a part of which time Usher was in Salonika, and 
they had to be abruptly terminated. They left the problem of whether it would be 
feasible to obtain a pure white Pekinese coat and skeletal form combined with a dark 
eye unsettled. Pearsons Busdubh, Mike and Tweedledee are nearer to the conditions 
of skeletal form, but the coat is like that of the white Pekinese in hair colour, and 
tends as the puppies grow older to show yellowish patches, it is not pure white, 
and this is exactly the trouble with pure-bred “white” Pekinese with dark eyes. 


* No. 1. Died at three days. Eyes closed. Both eyeballs translucent, iris blue grey. In opened 
eyeball iris and ciliary body black, fundus very pale. Microscopical Examination: Itetinal epithelium 
well pigmented, stroma of iris and ciliary body contains numerous faintly pigmented ohromatophores, 
which have a pale brown colour. Choroid in some parts shows no pigmentation, in other .parts some 
of the cells are distinctly pigmented. 

No. 2. Died at thirteen days. Eyes only partly open. Iris dark blue; deep red translueency of 
eyeball, when held up towards the light. Eyes frozen and opened; posterior Burfaoe of iriB dark brown; 
ciliary body black; fundus pale brown. Microscopical Examination : Betinal epithelial layers on back 
of iris, the pigment layer of oiliary body and the hexagonal oells of retina darkly pigmented. In iris 
stroma are numerous lightly pigmented ohromatophores, which do not appear to have received their 
full complement of pigment granules; lightly pigmented ohromatophores in smaller numbers are present 
in the ciliary body and ohoroid, also a few at sclero-oorneal margin. 

No. 3. Dark iris and fundus. Eyes frozen and opened; iris, oiliary body and fundus blaok. Micro¬ 
scopical Examination: Darkly pigmented epithelial layers of retina, pigment layer of oiliary body and 
hexagonal layer of retina; stroma of iris and ciliary body and ohoroid are darkly pigmented, muoh 
darker than in No. 2. 

A piece of skin from the head, plaoed in 10 °/ 0 formalin, embedded, out and examined microscopically, 
showed no pigment. 
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As the reader has been previously warned the present paper does not deal at 
length with the 500 and more dogs which have been bred. It endeavours solely to 
illustrate the types of albinism occurring and their relations to each other. Even thus 
many problems arise, which cannot at present be solved. Perhaps one of the most 
important of these is the question whether all Cornaz albinos are equivalent. We 
do not think the amount of cream in the coat of a white Pekinese albino has the 
slightest gametic importance; it varies so with the season of the year, and the 
interval since change of coat. On the other hand the intensity of diffused pigment 
in the coat of the Cornaz albino is very considerable and if a number of skins of 
these dogs be taken a fairly continuous scale can be reached running from a dog 
like Wang up to Frigg . Some might indeed link Wang on to the creamy white 
albinos, and suggest that there is only one kind of albinism, and that it varies in 
intensity. The skewbald Cornaz and the Cornaz with white shirt front give, however, 
a contrast which cannot be put on one side; there is an essential difference between 
the two areas. Further, our breeding experiments seem to indicate that: (i) two 
dondos, or white albino dogs, never give rise to dogs with dark eyes, but (ii) they 
can give rise to Cornaz albinos. Again (iii) we have never known a Cornaz mated 
with a Cornaz produce puppies with dark eyes, they have only albino puppies, 
white or Cornaz; but (iv) we have known a case in Vfcich a Cornaz skewbald, crossed 
by a white albino, produced dogs with pigmented coats and dark eyes. 

Our “Cornaz” dog seems exactly in accordance with what Weldon in his mice- 
breeding experiments (see Biometi'ika, Vol. XI. Appendix, “ Records of Mice-Breeding 
Experiments”) termed pale-blue-grey (“lilac”) and represented by the letter/. 
These mice had always pink eyes ( p ), and might be either whole-coloured 
(Weldons 6), or mixed white and grey in patches of different sizes (Weldon’s 

1 to 5), i.e. like our skewbalds and our Comazs with white shirt front or white 
markings. Weldon made a very large number of matings of pink-eyed lilac mice, 
his mice being, like our Cornaz albinos, extracted albinos. In all cases ( p,f ) x ( p,f ) 
bred true, and this whatever the hybrid generation they appeared in. He also bred 
white albino mice with the pink-eyed lilac mice. From these crosses he obtained 
as a rule mice with dark eyes and colour in the coat, even black (matings 2 H. 108, 

2 H. 156, see loc. cit. pp. 20 and 22) and chocolate, but they might be also fawn- 
yellow or wild colour. Occasionally, however, he obtained from this cross pink-eyed, 
and not dark-eyed, mice with fawn or “lilac” coats (see mating 3 H. 31, loc . cit p. 25). 
All this corresponds to what we have found for dogs, only the cross Cornaz albino 
with white albino seems less frequently than in mice to give coloured coats and 
dark eyes, and more often pink-eyed offspring. Of course the explanation may be 
that our pure white albinos had a latent Cornaz factor, or at least that some 
of them, like Jill , had. 

In this respect, perhaps, our Cornaz albino dogs corresponded to some extent 
more to Weldon’s Japanese waltzing mice; theso mice were skewbald, fawn and 
white, with albinotic eyes. The hairs of seven such mice have been examined, 
and in five of them the fawn hairs showed only diffused pigment and no pigment 
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granules. In two cases, however, the coloured patches showed to the naked eye 
a dusky hue and examined under a lens the patches were seen to consist of two 
kinds of hairs, the usual fawn and a darker kind. The former microscopically 
examined showed no pigment granules, the latter contained a certain number 
of pigment granules. The Japanese Waltzers bred true to their fawn patch and 
albinism, but when crossed with white albino mice, the offspring showed in part 
dark eyes and in part pink eyes, the former by far the most frequent. The pink 
eyes appear to have arisen when the Waltzer was mated by an extracted albino, 
i.e. by an albino very possibly having Waltzer blood in its ancestry. If we con¬ 
sider the waltzing albino mice really very similar in constitution to the lilac 
albino mice, i.e. when mating together as giving Waltzer albinos, and when mated 
with white albino mice as usually giving dark-eyed mice with colour in the coat, 
but pink-eyed mice if the white albino were an " extracted ” one, possibly having 
a factor for the albinism of the Japanese Waltzer, we can perhaps see some light 
on the problem of why the Cornaz crossed with the white albino does not always 
give a coloured dog with dark eyes. The white albino dog may carry a factor for 
Cornaz, and when mated with a Cornaz give an albino litter instead of one of 
coloured dogs with dark eyes. 

Our results indicate that there is considerable correspondence between the 
results for dogs and mice with regard to albinism. The correspondence is not 
complete—the lilac albino mouse crossed with the white albino mouse far more 
often produces coloured mice with dark eyes than the Cornaz albino dog crossed 
with the white albino produces coloured dogs with dark eyes—still the strange 
phenomenon exists in both dogs and mice. Albinotic eyes in both parents and an 
absence of granular pigment in the coats of both are not sufficient to indicate that 
the offspring may not have dark eyes and pigment granules in the coat hairs. 
A further point of difference between albinotic dogs and albinotic mice seems to 
lie in this: the mating of albinotic white mice, as far as my experience goes, 
always leads to albinotic white mice. The mating of two pure-bred Dondos, white 
Pekinese albinos, may on the other hand lead to Cornaz Pekinese albinos, i.e. to 
dogs with as markedly albinotic eyes as their parents, but with pale buff or light- 
grey patches of hair on the coat. As a rule, but perhaps not invariably, these 
patches contain only diffused pigment. These Cornaz albino dogs correspond closely 
to Weldon's “lilac” albino mice, or indeed to Japanese waltzing albinos, and when 
crossed with pure white albinos can give dogs with dark eyes and coloured coat. 
Thus a descent from pure white albino dogs to dogs with coloured coat and dark 
eyes seems feasible. On the other hand, the production of a “Cornaz albino mouse” 
—a fawn Japanese Waltzer or a “lilac ” albino—from albinotic white mice has not 
been recorded. One point may be noted here: the white albino dog—and the 
eyes of many have now been examined—has, like man, eyes which exhibit all 
the usual albinotic characteristics, the marked red reflex, the photophobia, and the 
defective sight, but, like adult man's albinotic eye, in all the cases yet examined, it is 
not absolutely free from pigment. On the other hand, the albinotic white mouse has 
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(iii) Pure-bred Pekinese albino Patty . 
Dondos, or pure-bred Pekinese albinos. 
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Plate II 
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Fo, showing contrast of white paws with pale buff body colour. 



(iii) Fe, skewbald Cornaz and Fo, Cornaz albino as puppies. 
Cornaz albinos from pure-bred Pekinese white albinos. 
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Plate III 



Fe, skewbald Cornaz albino, aged 5£ months, white and buff, bred by E. Nettleship. 
From pure-bred Pekinese albino (Dondo) Jill x pure-bred Pekinese albino 
(Dondo) Jack. 
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Plate IV 



(i) Pure white albinos, young dogs. 



(ii) Pure white albino puppies. 

Litters from extracted albino v pure Pekinese albino. 
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Plate V 



(i) Cornaz albinos, Hans and Grethel , showing grey (lilac) body colour 
and white forepaws and shirt fronts. 



(ii) Extracted Cornaz albinos, product of the mating of a pure bred Pekinese 
Cornaz (Wang) with a black bitch (from white Pekinese albino x black 
Pompek, the latter a product of white Pekinese albino x blaok Pomeranian). 
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Skin of piebald puppy from Dondo * Pompek. Eyes and hair Skin of skewbald Cornaz puppy with albinotio eyes, pale buff patches 

with many pigment granules. w 't* 1 diff u8 ®d pigment and a few granules. 
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Plate VII 



(0 (“) 

Juby, skewbald Cornaz albino, coat white with grey patches; the face and saddle patches 
can be distinguished, also the pure white on left thigh in (ii). 




(iii) ( iv ) 

Wu, pure white extracted albino. Both Juby and Wu have very marked red reflexes, and pink 

muoous membrane of nose. 
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an eye which stands almost unique among albinotic eyes of animals, in that it has 
been impossible hitherto to find traces of granular pigment associated with it*. 
Japanese albino mice and “lilac” mice have a small amount of pigment in the eyef. 
It would seem therefore that the pure white-coated albino mouse is more differan- 
tiated from the Comaz albinotic mouse (such as the Japanese or lilac albinotic 
mouse) than the pure-bred white albino Pekinese, the Dondo, is from the Cornaz albino 
Pekinese, both the latter having some amount, if small, of pigment in the eye. 

This may possibly account for the transition from pure albinism (no pigment 
granules anywhere) to normal pigmentation (granules in skin, hair and eye) being 
more feasible in the dog than in the mouse. 

* See Pearson, Nettleship and Usher: A Monograph on Albinism , p. 365, Case VI and p. 877, Case IX. 
f hoc, cit . p. 864, Case V and pp. 875 et scq.. Cases IV—IX. 
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ON THE DISTRIBUTION OF THE FIRST PRODUCT 
MOMENT-COEFFICIENT, IN SAMPLES DRAWN FROM 
AN INDEFINITELY LARGE NORMAL POPULATION. 


Bv KARL PEARSON, Q. B. JEFFERY and ETHEL M. ELDERTON. 


Let the origin be taken at the mean of the bivariate population and let that 
population be defined by a t , <r 2 and p. Let the sample be defined by m lt m,, 2 j, 2 2 
and r. Then it is known that the distribution surface of rn 2 is independent of 
that for Si, and r, so that we need not consider m h m 2 further. The surface for 
Sj, S 2 and r is given by* 

n /S,* 2rpSA W 

z = Zo g ~ 2 (1V) V<r, a ' H* ) Sn-i V,«-2 (J _ (« - 4) .(j)_ 


where n is > 2 and r lies between -1 and +1. 

Now take 

v _ /sTyy) ^ _ /2 (i - ?*) 

"' 3 " ff| v n *• 

and consider the integral 

7 0 = A n f + ‘dr Tdr dye~'*>*» 1 ^(xy) n ~ 2 (1 -r 2 )*< n " 4 >, 

“ ,.l c pWL-^)j-. >(ij) . 


where 


i 


Change the variables of integration to u, x and y, where u = rxy, and we have 

/„ = A 0 P * due^Jje - (* ,+ ^ (a*y 1 - u*)*<’ 1 " 4 > xydxdy, 

the double integral being taken over the area above and to the right of the 
hyperbola xy—u\. 

Let F(u) be an even function of u defined for u > 0 by 

F(u)=[[ e -(**+y*) (*y - u »)i (n-4) xydxdy ' 

J J xy>u 

Transform to coordinates ^ = xy, 7 j = x i + y i , 

F (u).r r 

Making further substitutions, 77» cosh <f> and f =u cosh \fr, this becomes 
F (u) = u n ~* I [ e"^ u 00811 * 00811 ♦ sinh** 8 -dr cosh ^ d^ d< 4 . 

J if/ssQ J <$>&() r T T 


B. A. Fisher, Biometrika , Vol. x. p, 510. 
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No, the Bessel fcoofta of the second kind snd imaginary a^mont is gi,on by- 

K 0 (w )« le-vom+dA, w > 0 . 

Hence # 

F(u) = A 0 (2« cosh i/r) sinh" -8 i/r cosh 

Further we havef 

f 0 ( 2 k cosh ^r) sinh^ l yr cosh yrcty = _.( x + 1 )^x+ i( 2 k) ^ 

and thus F(u)^r (hn - 1)u Un-* )Kin i(2u * . (iy)> 

for u > 0 , while for m < 0, F(u) is defined by F(- «) = F(u). Hence 

/o“ir(^w- 1 )A 0 J* (e 2f,u + e~ 2 pu )u^ n ~ 1 F ^ n _ 1 ( 2 m) du ...(i v )««. 
Accordingly the curve of frequency of 2 u which is such that 


v = 2u = 


n 


Pu 


•(v) 


IS 


l-p*<Ti<r* . 

y 2i"- 1 ^ .(vi). 

To complete this curve we have to find A 0 . If N be the total number of samples, 

N= f +cc i/dv = laESi*-}} F e ^±m v \n-\ K . . 

)-* 2**- 1 Jo 2 K \n-l («)*» 


A o r(Jn- 1 ) f« , 

J 0 coah P^ 1 "'VlW* .(vii). 


, W1 ” F ° Ceed short, y t0 the determination of this integral, but we will take 

dktrltli°V 10r %f n /° rm ,. We reqUire the mo,nen f‘Coefficients of the 
distribution of ft,. If we obtain these coefficients about the fixed origin « n = 0 

IZ^l ^ "71 , ^ f ° rmUla t0 ° btain them when the ^ ^o calculated 

eoeffil- f kT * ^“P 168, K We are Seekin g the »*» moment- 

coefficient about zero, we must include the term (r^S,)" under the signs of the 

triple integration. In other words we need 

, / 2 «j<r 2 (l — p*)\ m r+« 

I m - ^- ~ - JLJ ) A J_ x e*> u u m F(u) du 

" A m f ”(&*+(- ire-**)u”>F(u)du .(viii), 

4 _ r* / 2o ‘i°‘» (1 - p*)\ m + n -i 

^ m <7 l- n - ) .(viii)*”. 

* O. N. Watson, Theory of Bessel Functions, p, 172 ( 5 ). 

+ Watson, loc. cit. p. 417 (6). The remit (iii) follows from putting In Watson’s formula an 

o=2, ,=«, t=U oosh^ and remembering that jr_H(2«)=jr,(2 u ); for which see Watson, p. 79. ^ 


where 
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To evaluate I m consider the relation* 
f X (2m) da =— \ 4r ~ ~ + {Vt - 

Jo MV ' 2 x+ *r(\ + §)(l-p») x+ 4 

x F(-/i + k> P + h *• + $> H 1 + p)) 
.(ix). 

This gives 


where 


7 _i a ,/- r (i w -l)r(n + m-l)r(OT + l) , . . 

Im ~ Um v " fm (p) ’ 


A(p) = (- 


1 + p\i(n-l) + m 


F(-i(n-3),i(n-l),t(n+l) + m> l(l+p)) 


+ (-!)'"( 2 )’’ ^(-i(«- 3 ),i(n-l).H» + l) + »«.i(l-p)) 

.(x). 

Considering the function f m (p) and noting thatf 

^ F(a, A 7 , i (1 + />)) = J “ J (#■(«. A 7 - 1, £ (1 + p)) ~ F(«> A 7. t (1 + /»))). 

I/(«. A 7. i (1 - ^ A 7 -1. i (1 - p)> - *•(«. A % Hi- p)». 


1 _ o\i(»- 1 )+»* 


we obtain by straightforward differentiation, 


Again putting ?/i = — 1, 

, , , /1+P\ 4( "' 3) : 


1 /72 — 1 


2 \ 2 


- + m 


)/,«-! (P) 


/-i(p)= ( 2 P ) *(-*<«-»>. i(n-l), H« + l). J(l+p)) 

- ( Vv r (-1 (»- 3), i (., - », i <> +1), j (i - 


using the Euler transformation, or 


hence, putting p = 1, 


/_i(p) = 0 .(xii), 

r (7 - a - ff) r ( 7 ) = 
r(7-a)f(7-/3) ^v«. A% 1). 


, /^_ r(i(» + i) + m)r(i(»-i)+«o 

/bW ‘ r (n + m — 1) r (m + 1) . v 1 


* Watson, toe. rit. p. 388 (7), with proper interchange of symbols, 
t Forsyth, A Treatise on Differential Equations , 1885, p. 195. 
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Putting wi = 0 in (xi), 




by (xii), thus^o (p) is independent of p, or putting p = 1, by (xiii), 

,,, r(i(n+i»rft(»-i)) 


i>-i)r(i) . 

Hence putting to « 1 in (xi) we have 

d i u\ ln+l f( n \ i»+ir(i(»+i»T(Kn-i)) 

2 2 /o(p) ~ 2 2 r(n-l) r(l) 

, *• , . . 1 r(i(n + 3))r(J(n-l)) , x 

and integrating f x (p) = ^ l)r(l) - - 9 .< xv >> 

the constant being clearly zero, for from the expression for f m (p), 

/i(0)-0. 

Again, from (xi), 

d ... l« + 3 ir(i(n + 3))r(i(»-l)) 

dp fAp)= s t~ * (p) 2*-i>=i)- p ’ 

or /* (p) = ~ r (t (» ~ 1)) j + const. 


To determine the constant, put p = 1 and use (xiii); we have 

r(i(» + 5)) r(I(n + 3))_ 1 r(|(n + 5)) T(j(»-!)) 
r(n+ 1)2.1 “8 'r(n-l) 

8 r(a — 1) n 


+ const. 


1 r(|(»+5))r(i(a-i)) 


/MV'z-g r(n-l) \ P + nJ . 

Now use (xi) again, putting m = 3, integrating and remembering that/ w (0) = 0 
when m is odd ; we have 

1 r (J (n+7)) r (* (« - 1 )) /p» p\ , xvift 

Mp)-Yq T(n — 1)-U + . (XVn) - 

Finally, from (xi) once more we have 

i r(i(«+9))r(i(«-i))/P 4 . . conDt 

A (p) - 32 -f>ri) Via + 2nJ + con8t - 

where to determine the constant we use (xiii) or 

r(l(»+9))r(j(« + 7)) i r(j(n+9))rq(»-i)) t» + 6 

r(n + 3) 4.3.2 32" T(n-l) 12n ^ ° n8 ''- 


Thus f t a \- 1 r (i(» +9)) r<t (»-i)) t. e __8_\ 
Thus fi(p)- 384 T(n-l) \ p + n p n(n + 2)J 


.(xviii). 
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We can now find the corresponding / m s from (x), 

T -14 J- rqn-i)r(n-i) r(j(w + i))r(t(n-D) 

# * 0 + F(n-i) 


\A 0 ^t 


r(j(» + i))(i-P*) 4(n ' 
r(*»-i>r(i(«-i)) 


° (1 -pt)Un-l) 

or applying the duplication formula: 

r (in -1) r (i (n - 1)) - Vir (n - 2)/2*- 3 .(xix), 

h-\A a2n - 3 - . (xx) ‘ 

Now by (iv) 1 **, f £ (e pv + e ~ w 4 "‘ 1 _ i («) d» 

* n 

j gi 71 " 1 

“zlpXin^T)’ 

and hence by (xx), 

_ ^(rc- 2 ) _ (xxi ) 

2 4 "- 1 r(pt-i)(i 
Turning back to (vii) we find 

«• ii e irr(»- 2) 

2"-^(l-p a ) 4 ' ,r - 1) ’ 

.N2" - 2 (l — p 2 ) 4 ( n_1 ^ 

or Ao= -Vf(n -2). . (XXU) ’ 

and therefore by (viii) 6 *, 

.<-“>• 

The curve for the frequency distribution ofv= * 8 now known from 

(vi) using (xix), namely, 

y = N p^- P 2 l (n : X) _ epv {„*« - 1 K, _, („)} .(xxiv). 

,y \Z7r2 4n - 1 r(H«-l)) 4 

Here the function of v in curled brackets is not to change sign with v but the 
power of the exponential does so. Such is the curve which has to be traced for 
samples of various sizes (n) from populations of various correlations p *. 

We next turn to I v By (x) and (xv), 

t u j- r ( in - l)T(n) ra(n-f 8))T(i(n~l)) D 

1 * 1 r(i(« + 3))(l-p*) 4(n+1) 2P (n — 1) P ‘ 

A A 1 p(n-l) 2g 1 g- 2 (l -p a ) p (n-l) 

4 4,2(1-/) « 2(1 — p 2 ) 

* For the special case p=0 , or of sampling from uncorrelated material, 

v= .__— JL - 
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But IjJIo = mean value of p n in samples or 


Pn‘ 


{l-l)a 1 <r t p = {l- 1 n )p n .(xxv). 


where Pn is the value in the sampled population *. 
We proceed to find By (x) and (xvi), 


T -x a jz r(j»-i)r(« + i )2 1 r(i(• + 5))r /. i\ 

l t -iAt v ’ r r ( i(n + 6 ))( 1 _ p ,) 4 („ + 3 ) 8 ~ f (n~— 1) •“~{ p + n)’ 


It ln(n — 1) 
/o = 4(l 


^ + ») t! + ») ,r ' W ‘ 


n — I 


Pn ’ 


j - P*n « —r ( l + P % )**** .( xxvi )* 

J. n fv 


Since v = , we have 

1 — p* 


j -ZTf .(xxvii), 

■ Vn- 1 .(xxviii), 

1 — p a 


which are the values used in dealing with the frequency curves. 

Next we obtain from (x) and (xvii), using (xix), 

Is 3 A z (n+ l)n(n-l) / , 1 \ 

Jo 32 A 0 (1 -?Y + n p )’ 

or p,,^' = Safo-t* n -^- (tp 9 + Q .(xxix). 

Hence by the formula 

P* = /*a' - 3mVi' + i 73 , 

and remembering fi 2 = fH wc find after reduction, 

p,i*> = |(l~^)p (p a + 3) <*•*,«.(xxx). 

We are now in a position to determine Pn fti. 

R = (a^) a „ _i_ p a (p a + 3^ ,„ vi , 

PlA —i (l +p a ) s . ( ) ' . 

The function of p 2 on the right attains its maximum when p 2 =l and then 
Pn 0i - 8/(w — 1). We see accordingly that there can be sensible skewness, or fi x of 
the order 0*1, when the size of the sample is of order 100. For samples of the 
order 400, j3 x would be of the order *02, and could be treated for practical purposes 
as negligible. For the smallest samples (n ^ 2 (!)), and high correlation (/>-*■ 1 (1))» 
we can have 8, which is a very high value indeed. 

# This is a special case of the value of j*n given some time baok by K. Pearson for sampling out of 
a limited population N t with any form of distribution (not necessarily normal), i.e. 
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Turning to I t we have from (x), (xviii) and (xix), 


^4= 1-^4 


V7rr(j«-1) r (n + 3 )4 .JL2 _1_ T( $ (n + 9» Tft (n - 1» 
Tft ( w + 9)) (1-/>»)*<" +7 > 384 


1 -4 4 (« + 2)(n + l)n(» — 1) 
To ~ 1C Ao (l^p 5 ) 4 


{ pt+ l pi + n(V^ 2))’ 


T(«-l) 

4 t+ ^’ + ;r(^h>)’ 


= a \ a i (l + *) (l + “) (* - J) (f + nP i + n{ n + 2]) . (XXXiI) - 

We must now transfer to the mean by aid of the formula 
/i4 = /i 4 ' - 4/a a >i 4- 6/* 2 > 1 ' a - 3/Xi' 4 , 
remembering that *>nM/ = 

We find Pll fi A = So-x^o-a 4 ^1 

Hence we deduce 




; )i((n+l)(l+p’)’ + V) ... 


/n + 1 8 p 2 ^ 

( YYYIV^ 

U — 1 «-l (1 + P 2 f) . 


-u v \ 


V ‘ (l +p*YJ .' 


Pnto 

• 3 + — 
n 

p n @2 reaches its maximum, 15, when n — 2 and p = 1. Its minimum is attained 
when n-> oo and it then takes the normal value 3. 


Equations (xxvi), (xxx), (xxxi), (xxxiii) and (xxxiv) have been reached by 
Mr J. Pepper as special cases for the normal surface distribution from his far more 
general expressions for the moments of p Ui when the sampling is made from any 
form of distribution*. The novelty of the present method is that the same results 
are obtained, by starting from the actual distribution curve, when the sampling is 
made from a bivariate normal surface. 

The question now arises as to when we shall arrive at an adequate distribution 
curve for p n by applying (xxv), (xxvi), (xxxi) and (xxxiv) to fit a four-moment 
leptokurtic curve to the data, i.e. how closely does the Bessel-function curve (xxiv) 
fit a Pearson curve of Types IV or VI ? The following illustrations indicate that 
for practical purposes the labour of computing the ordinates of the Bessel-function 
curve may be avoided for fairly moderate-sized samples. 

We start with samples of 50, and compare the true Bessel-function curve with 
the Pearson type curve for p = 0 and p = *6. These are shown in Figs. 1 and 2, and 
it will be seen that the accordance is excellent. We next passed to a sample of 
30 for p « 0, and the fit is again excellent, and if for p = 0, it will be so for higher 
values of p (see Fig. 3), Accordingly we proceeded to lower still further the size of 
the sample and reduced n to 22. For p = *9 and *6 (Figs. 4 and 5) the accordance 
is all that could be desired. But when we pass to lower values of the correlation, 
p =* *3 and p = *1 (Figs. 6 and 7) the agreement while for most purposes statistically 
adequate is losing its merits. We can therefore conclude that below n* 22, the 
* Dr Wiehart has also given the values of ft and ft for the distribution otp n in Biometrika, Vol. xxK p. 42. 
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Pearson curve with the same mean, standard deviation, and constants 0 t and 
as the Bessel-function curve will not suffice to describe the distribution of jp u . 
Illustrations of this are provided in Fig. 8 for n = 10 and p = 0, and still more 
effectively in Figs. 9 and 10 corresponding to n = 6, for p=*\5 and *3. It is 
feasible, however, to get an adequate description of the distribution of p u or v 
by a Pearson curve at least down to n = 10, if we drop the equality of j3 2 for the 
Pearson curve and the p n distribution and make our conditions the equality of the 
ordinates at oc *= 0, and of the standard deviations. This is illustrated for the case 
of p = 0 and n = 10 in Fig. 8. But the wiser course seems to be to table the 
ordinates of the p = 0 curves for samples of 2 to 25, and only use the Pearson- 
curve method for samples greater than 25. When p is not zero the ordinates 
of the asymmetrical curve can be obtained by multiplying by the factor 

The Tables now provided in the Appendix for the small samples has meant much 
work in computation because notwithstanding the labours of Professor Watson 
adequate tables of K m (v) for our purposes are not in existence*. It was then a great 
comfort when Professor Watson provided us with a suitable asymptotic formula for 
some of our purposes, the first term sufficing in many cases. This formula is as follows: 

u /Ve- m '/ r + F Vl + V , rM»r*^r(s + i) ^(-1)' 

" ( )_ V 2m (! + *.)* \ t ) (* m Vl+?r 

.(xxxv), 

where 

4 , J _1 5 a 77 385 1 

0 Al 8 24 (1 + t *) ’ 2 128 576 ( 14 - 1 *) + 3456 (1 + ‘ 

This may be written in the more compact form, if tan 6 = vjm = t , 

._ jfi 

*-<•>- v a* ‘ w? r T(jf HJm- 

.(xxxv)^, 

where A 0 = 1, A x = \ - ^ cos a 0, A 2 = T fa-$facoB 2 0 + ^cos 4 #. 

Confining our attention to the first term we have for our frequency curve from (xxiv), 

y = y Q e m (P* “ + (1 + Vl -f t*) m /( l + &)* .(xxxvi), 

where m — \n — 

* The highest value tabled is K l0 (v), and we require (v), or the largest possible sample n is 

one of 22. Even then the existing table for JSC 10 (v) provides only a small portion of the rise of the fre¬ 
quency curve for pu ; we do not get near the modal value and a fortiori do not get the fall of the curve. 
For m=10, we can use the recurrence formula 

and the tables of e v K 0 (v) and e v K\ (v) on pp. 698—718 of Professor Watson’s book. But these only carry 
us up to v = 16, a point short of the mode for the oase of a sample of 22. With samples of 60 and 
upwards, the labour involved in the use of the reourrenoe formula becomes prohibitive. It practioally 
amounts to oomputing new tables of K m (v), and extending the existing tables of Kq (v) and Ki (v) far 
beyond the range of the argument provided as v may run to 90 or even 200, while the frequency of pn 
is stiU quite sensible. 
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Taking logarithmic differentials, the mode of this curve is given by 


m 


( p </r+?) 


mi 


vT+W Vl + t*l + Vl + t* 2Vl + « a 


= 0 , 


or, if tan 6 = t f and 6 corresponds to mode, 

tan — p — 


sin 2 6 
4 m 


.(xxxvii). 


This equation admits of easy solution by successive approximations if m be 
fairly large. For example, to a first approximation, 

tan \6 = p t or t = tan 6 = 2p/(l — p 2 ); 
hence to a second approximation, 

p(W) 


which gives us 


or 


Mnvr 

i. ten 9-^(1 


.(xxxviii). 


As an illustration consider a sample of 22 from a population with correlation p 
of *6; we find 

v = 17*3713. 

Now by Equations (xxvii), (xxviii), (xxxi) and (xxxiv)*”* we have for the exact 
values, 

v = 19*6875, tr 9 = 8*350,243, 

& = *307,755, ^ 2 = 3-508,156. 

Hence by the Pearson-curve formula for the mode 

<Tv (/S 2 + 3) 


v = v — , 


.(xxxix), 


2(5A-6A-9) . 

we find 2 ; = 17*3301, 

or there is less than \ °/ a difference in the values given by (xxxviii) and (xxxix). 

As we have neglected terms of the order 1 /m z in obtaining (xxxviii), it is quite 
possible that (xxxix) really gives the modal value of v , with closer approximation 
than £ °/ c . It is clear, however, that (xxxix) will be of sufficient accuracy for most 
statistical purposes, when the sample is as large as 22. 

But the reader must not expect to get equal agreement when p is small. Thus 
when p = 0*1, (xxxviii) gives v = 1*8202, but from the exact equations as before, 

v = 2121,212, a v = 4*651,951, 

= -016,750, yS 2 = 3*296,918, 
whence from (xxxix) v = 1*8645, 

or, the two values differ by somewhat over 2 °/ 0 in the position of the mode. 
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It is worth while considering whether this arises from not having gone far 
enough with the solution of (xxxvii) by approximation, or from our determination 
of the mode being based only on the first term of Professor Watson's series. Now 
the actual solution of (xxxvii) is 0 = 1O°24'*719 which gives v = 1*8375, which 
differs from i> = 1*8645 by a little less than 1*5 %• This is not very serious for 
most statistical purposes, but it indicates that (xxxvii) and (xxxix) for low values 
of p and small samples cannot be brought closer into accord without considering 
higher order terms in one or both of them. 

If we take into account the A 1 torm in Professor Watson's formula and develop 
the equation for the mode as far as terms in 1 /m a , we have 

tan \6 = p - (l + °l S J (5 cos 2 0-1)) .(xxxvii)** 

instead of (xxxvii). Now it is clear if p be small, e.g. =01, 0 will be small and 
accordingly cos 0 approach unity, and the corrective term accordingly will be positive, 
approaching 1 1 ^. Thus 0 will be slightly reduced instead of increased by this first 
correction. It is not till 0 = about 63° 25' that the term in l/?a 2 changes sign, or 
roughly when p is about 072. The agreement of (xxxvii) and (xxxix) in our first 
illustration may very likely be associated with the relative smallness of the term 
in l/m 2 when p = 0*G notwithstanding the smallness of m . Unless p be fairly large 
and m of the order of 50 or more, it does not seem probable that a satisfactory 
formula for the mode or the most probable value of p n can be found from (xxxv). 


The true mode v would be determined by 

p + m/v + K m ' (v)/K m (v) = 0 .(xl). 

In order to obtain a reasonable value for v by tabling, we retufn to Equation (xxiv) 
and write it in the form 

y = N{\ - P 2 )* : 1 ) r T in _! (V) .(xli). 


Here the function T^ n „ l (v)= T m (v ) does not contain p } and accordingly, if tabled, 
will be the same for samples of the same size n whatever be the value of p. When 
p = 0, then y~NT^ v „j(v) will give the frequency curve of p n for a sample of 
size w, by simply multiplying by the number N of samples. Our Table in the 
Appendix gives the values of T m ( v) and log T m ( v) for v = 0 to v = 120 and for m ~ 0 
to 11 * 5 , the latter by intervals of 0 ' 5 , or from n = 2 to n = 25 in the case of sampling*. 
The interval of v in order to make the table publishable had to be varied. Thus 
from m = 0 to 5*5 the table proceeds by intervals of 0*1 as the argument v passes 
from 0 to 4 0 ; from m = 6 to 11 * 5 , the argument increases by intervals of 0 * 5 ; for 
all values of m, the argument alters by 0’5 from 4*0 to 16*0, by 2*0 from 16*0 to 
40*0 and by 5*0 from 40*0 to 120*0. 

* It is easy to deduoe that 

= T^ie~ v ( l+o), r 4 (v) = 1 Ve- v (i;2 + 8v + 8), etc. 

The successive T n+ j may be found from the formula 

r »+| W = + W + 2(«+Ij 2 * +i ' 
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T m ( v ) = 


T m (0) = ; 


v m K m (v) .(xlii) 


.(xliii). 


Clearly T m (v) = i ^ rJ^J) .< xlii > 

and since the value of v m K m {v) when v = 0 is 2 m “ i r(rn), it follows that 

Tm ■ 2V^F(w +\) . (xhu ^ 

The reduction formula, from which the Appendix Table has been computed 
(the highest function T m (v) being independently computed in order to act as 
a check), is as follows: 

JW. <»> - i r— W+sJtt t - <•>.<*“'>• 

Making use of the formula* 

K m -i(v) + Km+l (v)--2K m '(v), 
the equation (xl) becomes 

m 1 K m ~, (?) + if m+I (?) 

9 + « = 2 JT*(«j' 

j[m+ l (0_ (y ) 

T m (v) -2m + l K m {v)- 


P + -* 


Hence 


or, using (xliv), 


m_i2»i+ir„ +1 (i) i « r—i(Q 

P v 2 v ~ + 2 2»t — 1 T m (?) ’ 

V T m ^(v) 

9 2m -1 T m (v) . 


Accordingly if a table be formed of the expression on the right of (xlv) for each 
value of v and m , it will give the value of p for which, with the corresponding value 
of m, the value of v gives the mode. Consequently by backward interpolation from 
this table we can find for a given value of m and a given value of p the modal 
value of the ordinate or v. We have provided the needful entries in the third 
column of each section of the table of T m (v). Thus the calculation of the true 
modal value now becomes fairly easy. 

We reach a specially interesting case when p = 0, or we are sampling from 
uncorrelated material. In this case the distribution curve for p u is symmetrical 
about v = 0, while we have from (xxviii), (xxxi) and (xxxiv)**, 


r v a — n — 1, pn& i — 0, pu &2 — 3 4* 


ft — 1 ’ 


The appropriate curve is 


V = 


K)’ 


where q - $ (5y9* - 9)/(£ g - 3) => J (n + 4), 

a a = 0 2 2&/(/3 2 -3) = n a -], 

n r ( P )_ n r p(»+4 » 

V2w«r V(p — i)T(/> — J) V(n*— l)w r*(}(» + 8)) 

* Watson, Theory of Bessel Functions , p. 79. 
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Thus the equation to the required curve having the same first four moments 
as the frequency curve for p u is 

N r(l(n + 4)) 1 , , . x 

y=, 'Vp3T ) r(|(^8 > ) 7- TTKTJi .<*>."> 

The true curve of frequency is 

y ~ 7Z& 1T f< k» - i )) 1 1 <•> . 

We shall consider later for what value of n these two curves for statistical 
purposes become practically identical 

Another point which it is of interest to consider is the relation of the 
distributions of p n to those of r. In Biometrika, Vol. XI. pp. 379—403, are given 
the frequency curves for various distributions of r in the case of samples of various 
sizes. The fix and of such distributions have been tabled. 

Now p n « hence 

v =-- 

1 — fr cr x a 2 

or, v would be proportional to r, if we neglected the variation of % lf X 2 or treated 
SiSj/cTjo-g as a constant. This is what Dr Wishart has done* when he replaces the 
true tetrad 


by 


r tu r w 


S. S.S.S, 

cr 8 €t u ar v 


( r m r tv - r tu r n ), 


and supposes the variation of the latter to be sensibly the same as the variation of 
the former, i.e. he is neglecting the variation of the standard deviations of the 
samples. 

Now the value of v only differs from that of r a by a constant and we can 
at once compare the fix and fi 2 of v or P 11 I<t\<t 2 with those of r by means of 
Equations (xxxi) and (xxxiv) b *\ Table I on p. 186 gives the comparative values. 


The values of fi 2 for p * 0 indicate clearly the difference of the r and " r 


<r l <7 2 


distributions. The former has a platykurtic and the latter a leptokurtic curve and 
the distributions approach the normal curve from opposite sides, the difference 
between them being very marked for small samples. When we reach p = *6, how¬ 
ever, both curves have become leptokurtic—the transition for the r-distribution 
takes place before p = ’45—and the fi 2 for the r-curve rises with great rapidity and 
reaches remarkable values for high correlations in the sampled population. As the 
A for the r-curve rises much above that for the v-curve so does its skewness. An 
examination of the column for p =* *9 will show how little can be inferred from the 


distribution of 


*i<r 2 


r as to the distribution of r. It is not, however, only the shapes 


of the curves as judged from fix, fi 2 which diverge widely; the position and spread 


* British Journal of Psychology , Vol xxx. p. 188. 
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TABLE I. 


y9i and for r and v . 


Size of 

/> = 0*0 

P = 

*3 

P- 

•6 

P~ 

*9 

Sample 

r 

V 

r 

V 

r 

V 

r 

V 

r, (01 

*0000 

•oooo 

*3077 

•6636 

1 -7207 

1-6157 

13*0290 

1*8847 

102 

2*0000 

4*5000 

2*4201 

4-9954 

4*4027 

5*6678 

21*7579 

5-9835 


*0000 

•oooo 

*2317 

*2949 

1-2002 

*7181 

5-7475 

*8376 

2*4545 

3-0667 

2*8292 

3-8860 

4*4598 

4-1857 

13*6667 

4*3260 

"■{& 

•0000 

*0000 

•1745 

*1896 

•8473 

•4616 

3*0956 

*5385 

2*6250 

3*4286 

2*9265 

3*5701 

1*1375 

3*7622 

8*8548 

3-8524 

*'{£ 

•0000 

•oooo 

*1386 

*1397 

*6464 

•3401 

2*0603 

1 

*3968 ! 

2*7143 

3*3158 

2*9623 

3*4201 

3*9055 

3*5616 

6*8681 

j 

3*6281 1 


*0000 

•oooo 

•1146 

*1106 

•5203 

*2693 

1-5334 

•3141 1 

2*7692 

3*2500 

2-9788 

3*3326 

3*7453 

3*4446 

5-8584 

3-4972 | 

“ {?; 

•0000 

•oooo 

•0611 

*0542 

*2611 

*1319 

•5004 

•1539 

2*8824 

3-1224 

2-9991 

3 1629 

3*3891 

3*2178 

3-8731 

3-2435 

,o ° {& 

*0000 

•oooo 

•0315 

*0268 

•1303 

'0653 

•3115 

*0761 

2*9400 

3-0606 

3*0021 

3-0806 

3*1974 

3*1078 

3-5790 

3*1205 


as determined by the means and standard deviations are very different when there 
is considerable correlation, say above 0*4 or 05, in the sampled population. The 
means and standard deviations given in Table II illustrate this. For those upon 
whom a graph has a greater impressional value than a table, Figs. 11, 12 and 13 
are provided; they correspond to samples of 20, 10, and 5 from populations with 
correlations of 0*6, 0*0 and 0*9 respectively. These values were originally selected 
to determine a proper range of argument to give to our v in the Table of T m (v)*. 

* It may be suggested that tho divergence of the two carves would be less if we took much larger 
samples. That this is not so the following values of the standard deviations indicate for p=0*5: 

Size of Sample 

__A-— 

50 100 400 

Standard Deviation of r *108,620 *075,897 *037,612 

P - u *156,525 *111,243 

(Tiff 2 

In each case the standard deviation of is about 50 °L increase on that of r. 

*1*2 


*055,832 
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TABLE II. 


Means and Standard Deviations of r and 


- r . 

a t (r % 


Size of 
Sample 

II 

o 

© 

p = 0’8 

p=0*6 

p=0’9 

r 

<r,<r a 

r 

Zl ?, r 
a 

r 

S,2 a 

— - r 

<Ti<r 2 

r 

<T \ ff a 

* fMean 

6 \S.D. 

*0000 

*5000 

•0000 

*4000 

•2671 

•4740 

•2400 

•4176 

•5480 

•3858 

•4800 

•4665 

•8687 

•1748 

•7200 

•5381 

/Mean 

lU \S.D. 

1K /Mean 
10 \S.D. 

*00(X) 

*3000 

•0000 

•3000 

•2860 

•3103 

•2700 

•3132 

•5776 

•2355 

•5400 

•3499 

•8887 

•0832 

•8100 

•4036 

*0000 

*2673 

•0000 

•2494 

*2903 

•2470 

•2833 

•2604 

•5858 

•1828 

•5667 

•2909 

•8932 

•0602 

•8500 

•3356 

90 /Mean 
20 \S.I). 

*0000 

*2294 

•0000 

•2179 

•2928 
•2113 | 

*2850 

•2275 

5896 

•1543 

*5700 

*2542 

•8951 

•0493 

•8550 

•2932 

fMean 
25 \S.D. 

*0000 

•2041 

*0000 

•1960 

•2943 

•1875 

•2880 

•2046 

*5918 

•1359 

•5760 

•2285 

•8962 

•0427 

•8640 

•2636 

»{“r 

•0000 

•1429 

*0000 

•1400 

•2972 

•1306 

•2940 

•1462 

•5960 

•0933 

•5880 

•1633 

•8982 

•0284 

•8820 

•1884 


•(XXX) 

•1005 

•0000 

•0995 

•2986 

•0917 

■2970 

•1039 

*5980 

•0650 

•6940 

•1160 

•8991 j 
•0195 

•8910 

•1339 


A word may be said here as to the method of using the Table. What we usually 
need is the distribution curve of Pn/cri<r 2 or (1 — p 2 )v/n . What is tabled is, to each 
value of the argument v t the value of the function 

= T m ( v ) of our equation (xlii). 


Here m is to be obtained from the size of sample n by the relation m — \n — 1. 

But the actual frequency curve of v, if there be a correlation p and N 
samples, is 

y v = N(l-p*r + h <f v T m (v). 

Hence to obtain the ordinates of the v-curve we must multiply T m (v) taken from 
the Appendix Table by N(l — p a ) m + * e fiV . This can be done by using the values of 
log T m ( v ), and adding the logs of N(1 — p*) m+ 1 and p log e x v ; or often more simply 
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by using Newman and Glaisher’s Tables of & and e~ x in the Cambridge Philosophical 
Transactions*. Writing R for Pul( <T i a ‘z) we need the ordinates y R of the frequency 
curve for 22. Clearly 22 and v must have their elements of frequency the same, or 

y R dR = y v dv = y v ~ n dR t 
1 ~P 

or 

Thus y R - N x n (1 - p*) m ~ (v). 

Accordingly, to get the frequency distribution of 22, we have to plot to the 
values of 22 = (1 — p a ) vjn t the values of y Ry or the T m (v) of our Table multiplied by 
N x n x (1 — p*) m ~^eP v . Of course v must be given both negative and positive 
values in the factor e pv , but the value of T m (v) is the same for v positive or 
negative. Thus by aid of the Table of T m ( v ), it is relatively easy to obtain, what¬ 
ever be the value of p, the distribution curve of 22 = p 11 /o- 1 <r a , for small samples, 
i.e. n < 25. When n > 25, the Pearson curve obtained from the v , cr v , and v f3 2 of 
Equations (xxvii), (xxviii), (xxxi)and (xxxiv) 6i ® will be amply sufficient to describe 
the distribution. 


From our Tables I and II, and again from the examples we have given graphically 
(Figs. 11—13), it seems impossible to predict from the distribution of a product- 
moment what is likely to be the distribution of a correlation-coefficient. It is 
equally—or rather more—unlikely that we can learn anything about the distribution 
of a function of correlation-coefficients from the distribution of the corresponding 
product-moments. For example, it is highly improbable that the distribution of 
tetrads, r m r tv — r tu r tv , will have even approximately the same distribution curve as 


Pn ( m )Pn (fa) ~ Pn(tu)Pn(w ) _ S , 

o‘ 8 c‘tcr u cr v O’gO'tO'u&v 


(Vtu^tv 't'tuVgv). 


Of course we may start ab initio with either of these expressions, but having done so, 
we cannot apply the standard deviation of the one to the consideration of goodness 
of fit of theory to observation in the case of the other. The extreme leptokurtosis of 
the p n curves, while the r-curves are of limited range and of much less variation (see 
our Table II), suggests that for practical purposes it would be advantageous to use 
correlation-coefficient tetrads rather than the product-moment tetrads. There are 
other interesting differences between the r and pnl<r x a 2 curves. The r-curves have 
all negative skewness, i.e. the mode is greater than the moan when a true mode 
exists, but in the case of the jpn/^i^a curves the skewness is positive or the mode 
is less than the mean. For samples of two the r-curve consists of two concentrated 
frequency lumps (r = ± 1), for samples of three of CT-curves, and for samples of four 
J-like curves—all these without true modes. In the corresponding cases for p n we 
have always continuous curves; in the case of samples of two we have a double 
J-curve; if p be not zero we have two unequal /-curves, set back to back along the 
asymptote; in the case of samples of three we have, if p be not zero, a double 

* Vol. xzn. 1888, pp. 145—272. 
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exponential curve, consisting of two unequal exponential curves of the same maximum 
height at pn « 0. Hence for samples of two and three no modal values are given 
in our Table for the different values of p. For samples of three the value of the 
ordinate at the vertical for v « 0 of the exponential curves is ’5. 

Dr Wishart*, as we have said, has obtained the standard deviation c P of the 
product-moment tetrad ; if a F be the standard deviation of the correlation-coefficient 
tetrad, Dr Wishart really assumes that 

°"p/( cr « t<r u <rv) 

is interchangeable with a F . But in doing this he is neglecting the variability of 
2 s £*2 w St,, and our present discussion shows that in a like case the variability of 
pn/o' 2 is much greater than that of r. Dr Wishart has also overlooked the fact 
that if the sample values are inserted in the formula for o>, since these values differ 
from the sampled population values by terms of the order 1/ViV, it is idle to 
introduce the terms in 1/N, whether they be those he has himself found or those 
suggested by Spearman and Holzinger, in order to correct the first approximationf. 
Clearly using <r F + a 8 <T t <r u cr v instead of o> will give a much larger variability to the 
tetrad. Dr Wishart J applies it to Holzingcr’s data and states that “ the discrepancy 
of the observations from the two-factor theory...is seen to disappear” when his 
value of a P is used; and again: “The extraordinarily close agreement...may be 
regarded as fortuitous, but we can at least say that in one example we have 
obtained striking confirmation of the two-factor theory.” The “ discrepancy ”§ is of 
course likely to disappear if <r P is essentially greater than and we fear that 
emphasis must be laid on the word “ fortuitous,” if an imperfect theory is applied to 
link up an hypothesis with observation. Dr Wishart has suggested a new manner 
of dealing with the two-factor problem, namely, by using product-moment tetrads 
instead of correlation-coefficient tetrads, on the ground that the standard deviation 
of the fonner tetrad is accurately known. At first sight this appears to have 
advantages, but when we remember that we have to use in the formulae the 
sampled population values, and therefore terms of the second order cannot be 
accurately determined at all, and when we note further the extrenu variability of 
product-moments (divided of course by <r-products) as compared with correlation- 
coefficient variabilities, we see that the proposed method has its disadvantages. 
Further, we cannot admit that a formula thus obtained is in any way applicable to 
correlation-coefficient tetrads as Dr Wishart applies it. We must de novo compute 
the product-moment tetrads. Professor Spearman apparently holds Dr Wishart’s 

* The British Journal of Psychology , Vol. xix. p. 183. 

f See Pearson and Moul, “The Mathematics of Intelligence,” Biometrika , Vol. xix. p. 251. 

} Loc. cit. p. 188. 

§ We would draw attention to the fact that in the paper by Pearson and Monl it was not asserted that 
there was a discrepancy between theory and observation; the difference was shown to be 2*11 times its 
probable error, and the conclusion drawn that this did not indicate either “a very good or vezy bad 
aooordanoe between theory and observation.” What it certainly .does not justify is the remark of Spearman 
and Holzinger that “in this case at any rate every one of the abilities can be resolved into two in¬ 
dependent factors, the one being always specific and the other throughout oommon ” (The British Journal 
of Psychology , Vol. xvi. p. 88). 
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formula when applied to correlation-coefficient tetrads to be wholly satisfactory, 
and to include “ all the terms in ; he fails to realise that the variability of F 
must be essentially less than that of JP*. 

Apart from the illustration given above of the dangers which may attend the 
application of the standard error of one quantity to another which bears some 
resemblance to it, the present paper appears to us to indicate that the distribution 
of product-moments, while in many cases less skew than that of correlation-co- 
efficients, has in numerous instances such a wide range and low modal frequency 
that it cannot replace the direct consideration of the correlation-coefficient. At 
the same time the distribution of the product moment-coefficient introduces into 
statistics the study of functions based on the Bessel functions of imaginary argu¬ 
ment. In a paper by Dr Fisherf, Bessel functions have been also introduced in 
the case of the distribution of the multiple correlation-coefficient. It is interesting 
to note how, as statistical theory advances, even on the basis of simple normal dis¬ 
tributions, we call into requisition more and more functions familiar in physics. 

As a general statement of the results of this paper we may say that the 
distribution of pu has been ascertained theoretically when the sampling is from 
a normal population, and tables have been provided for tracing the curve of 
distribution of pu , up to samples of 25. Furtherjt has been shown that good fits 
are obtained for samples greater than 25 by use of a Pearson curve with the 
appropriate moment-coefficients. This gives us confidence in holding that for 
samples in excess of 25 the distribution of p llt with the use of the more general 
moments of that coefficient determined for any form of sampled population by 
Mr Pepper, will also be effectively described by a Pear-son curve. 


* See The British Journal of Psychology, VoJ. xix. p. 101. 
t B.S. Proc. A, Vol. cxxi. p. 663. 
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TABLE OF THE PRODUCT MOMENT T m FUNCTION. 

Computed by ETHEL M. ELDERTON. 

The figures in round brackets in the T m columns give the number of zeros 
between the decimal place and the first integer recorded. The logarithms of T m have 
been obtained, not from the six-figure value of T m in the T m column, but from the 
fuller computed value before it was cut down. 

The third column headed p gives the value of the argument v, for which, with 
that value of p, the argument v would be the mode v of the distribution curve. 

If v a» , then we have for the distribution curve of v : 

1 — p* o- x <j 2 

y r = N (1 - p*y n+ i e^T m (v), 

or log y v = log N+ (m + £) log (1 - p 2 ) 4- pv log„ + log T m (v). 

Here T m ( v) and its logarithm do not change sign with v. Thus the ordinates of 
the curve of distribution of p lx in samples of size ft, (m = £ft — 1), are easily computed 
from the log T m column. 

When ft is equal to or exceeds 25, the distribution curve is adequately given by 
a Pearson curve with the Mean, S.D., and the y9i and /3 Z determined by Equations 
(xxv), (xxviii), (xxxi) and (xxxiv). 
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LAPLACE, being Extracts from Lectures delivered by 
Karl Pearson . 


In preparing a course of lectures in the spring of this year on the “ Life and 
Work of Laplace,” as part of a more general series on the History of Statistics and 
the Theory of Probability, I was astonished to find how meagre were the French 
accounts of Laplace's family, boyhood, education and personal opinions. I was still 
more surprised to discover statements made—often to his discredit—by the 
smaller historians of mathematics not only in this country, but abroad and even 
in France itself, which were most certainly in grave error. 

In the biographies we are told that he was the son of a peasant, who dwelt in 
Beaumont-en-Auge in the department of Calvados in Western Normandy, that he 
went as a day-boy to the “military school” of Beaumont-en-Auge and afterwards 
became a "professeur provisoire” there; find that he owed his education to “the 
interest excited by his lively parts in some persons of position.” Rouse Ball citing 
no authority, and probably having none, states that Laplace was son of a labourer 
and that “from a pupil he became an usher in the school at Beaumont*.” We are 
even told that the biographers' ignorance of the great mathematician's early life 
was due to Laplace's false shame of springing from such humble origins. We are 
also informed that Laplace was a time-server seeking honours and political advance¬ 
ment by flattering the successive French rulers—again with no evidential proof. At 
least, our friends, the smaller historians, might have taken the trouble to inquire 
whether there was a “military school” in Beaumont-en-Auge before the year 1771 
when Laplace left home for Paris, they might have ascertained the position of 
Laplace's father, and taken the pains to find out whether Poisson's statement that 
Laplace was educated at the University of Caenf was or was not correct 1 While 
the facts I have now the honour of publishing and which I owe to the great kind- 

* A Short History of Mathematics , p. 883. 

t Henry VI of England, not content with his colleges at Eton and Cambridge, blessed Caen with 
a University in 1486; to thiB University, the later one was a successor. Caen was .the burial place of 
William the Conqueror, and from the earliest date his Castle was and is still a military centre for the 
district. In the eighteenth century it possessed four learned societies, two of which were royal foundations. 
The Caen Royal Academy started its publications in 1754, five years after Laplace's birth, and, except 
when its sohools and academies were suspended for supporting the Girondists, has continued to be 
a centre of art, military and civil education, justice and administration for Lower Normandy. Its streets 
today are adorned with the statues of Laplace, Malherbe and Ill lie de Beaumont. A Jesuit College, 
ancient seminaries and schools existed in Caen in the eighteenth century beside the University and the 
military school. Indeed Caen was probably in Laplace's day the most intellectually aotive of all the 
towns of Normandy. It was here that Laplace was educated and was provisionally a professor. It was 
here he wrote his first paper published in the Melanges of the Royal Society of Turin, Tome rv. 1766- 
1769, at least two years before he went at 22 or 28 to Paris in 1771. Thus before he was 20 he was in 
touch with Lagrange in Turin. He did not go to Paris a raw self-taught country lad with only a peasant 
background! In 1765 at the age of sixteen Laplaoe left the “School of the Duke of Orleans" in 
Beaumont and went to the University of Caen, where he appears to have studied for five years. The 
“liloole militaire" of Beaumont did not replace the old school until 1776. 
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ness of M. l’Abb6 Simon and M. le Comte de Colberfc-Laplace, the illustrious 
mathematician’s great-great-grandson, dispel the crude misstatements as to Laplace's 
family and education*, I have ventured to add to them a few extracts from my 

* The following unpublished details with regard to Laplaee may fitly find a place here; they are 
contained in a letter of M. le Comte de Oolbert-Laplaoe to the lecturer, 

Orbec, 8 Rue Grande, 16 fdvrier 1929. 

Monsieur, Vous mo voyez trds perplexe. L’incendie qui a ddtruit radicalement le eb&teau de Mailloe, 
en 1925, a andanti tons mes papiers de famille. C’est aveo une peine, que vous pouvez imaginer, que j’ai 
vu disparaitre la correspondanoe de Lagrange et de Laplace que je voulais publier (Laplace y tenait 
oertainement beauooup, puisqu’il avait conserve le double de ses propres lettrea), et deux grands volumes 
de correspondanoe aveo les savants europeens, dont beauooup d’anglais. 

J’ai cependant eu le bonheur de connaltre, jusqu’en 1889, m& grand’mdre, nde en 1813, orpheline 
de mdre & sa naissance, Angdlique de Portes, marquise de Colbert'Chabanais, qui avait dtd dlevde par ses 
grandsparents Laplace de 1813 k 1827 et avait fort bien oonnu Laplace. 

Je t&oherai, Monsieur, de ooordonner raes souvenirs, mais ce sont des souvenirs de sonvenirs, et 
par consequent ils peuvent bien avoir dtd ddformds. Je possddais aussi des papiers relatifs k la famille 
Laplace (L’orthographe variable: Delaplaoe, de la Place, de Laplace s’y trouvait). C’dtaient des comptes 
de tutelle, et des papiers relatifs 4 la propriety du Mdrisier k Beaumont-en-Auge que je possdde encore. 
Ces papiers dont une grande partie remontaient k 17— (1720, je crois), bien difficiles k lire, m’ont paru 
indiquer qu’il une oertaine dpoque la famille Laplace avait fieffd k d’autres cette propridtd. Laplace 
l’avait rachete k la Revolution, et nous la possddons depuis k la fagon moderne des propriety. C’est 
ce qui me rend trds inddois sur le lieu de naissance de Laplaee. Dans mon enfanee, le fermier montrait 
un cabinet sombre, et peu con for table, dans la tnaison du Mdrisier, comme dtant le lieu de naissance 
de Laplaee. Beaumont-on-Auge a appose une plaque de marbre sur une maison du bourg, indiquant 
qu’elle etait la maison natale de ee savant. Je ne deciderai pas. Je puis vous dire cependant, que j’ai 
vu rire mon regrette pdre de la tradition qui faisait naitre mon aieul dans un reduit obsour, alors que 
la maison possedait et possdde enoore des chambres oonfortables. 

Je n’ai plus les noms exacts de son pere, sa mere s’appelait Sochon. J’ai lu, dans un ouvrage de 
la Societe liistoriqne de Lisieux, qu’elle appartenait k une famille anoienne et distingude. Mais je ne sais 
pas au juste ce qui peut en dtre de eette vanitd, k laquelle ni Laplace ni aucun de nous ne pouvaient 
donner d’importance. Pour moi, j’ai la certitude que la famille de Laplace etait une bonne famille du 
pays, comme il y en a beauooup enoore. Je ne sais pas si l’on peut dire que c’etait une famille de 
cultivateurs. De 1700 k 1750, les Laplace se sont occupes de culture de la terre, mais j’ai vu par 
les papiers que le pdre de Laplace s’occupait du commerce des oidres—ce qui n’exolue pas. k la vdritd, 
l’oooupation de cultiver la terre. Ce pdre dtait Syndic de Beaumont en 17—, ce qui indique qu’il dtait 
dans une bonne situation. U y avait un oncle curd, j’ignore s’il dtait du cdtd paternel ou maternel 
[paternel, Louis de Laplace. Voyez la gdndalogie]; et c’est parce que la famille voulait un prdtre, que 
le jeune Pierre-Simon Laplaoc fit d’abord ses premieres etudes en ce sens.... 

Une publication, j Revue illuntrtie du Calvados , interrompue par la guerre, a public en 1912 ou 1918 
une dtuae sur le prieurd de Beaumont; on y parlait du sdjour de Laplace. Peut-dtre, M. Moridre, 
direoteur du journal Le Lenovien, rue du Bouteiller k Lisieux, possdde-t-il enoore des exemplaires de 
oe numdro. En tout oas, Monsieur l’Abbd Simon, curd de Montreuil-en-Auge, president de la Sooidtd 
historique de Lisieux, a dcrit quelques articles sur Laplace et Beaumont. 

Vous me permettrez. Monsieur, de faire appel k mes souvenirs d’enfant. Nous sommes arrives k un 

E rint de la vie de Laplace, oil la precision n’est gadre possible. Par suite de quelles oireonstanoes 
aplace eBt-il venu k Paris oil a commence sa brillante earridre? Voici, oe que ma grand’mdre me 
racontait vers 1884, pendant un de oes longs trajets en voiture entre Lisieux et Mailloe qu’elle faisait 
deux fois par semaine. Muni d’une lettre de recommandation pour d’Alembert, (j’ai oublid de qui dtait 
cette lettre) Laplace se rendit k Paris. D’Alembert, qui devait dtre assailli par de nombreuses lettrea de 
oe genre, le regut assez mal, et pour s’en ddbarrasser lui remit un livre assez gros, peut-dtre ses derniers 
travaux, en lui disant de revenir quand il l'aurait lu. Laplace revint quelques jours plus tard, et trouva 
d’Alembert enoore moins aimable que la premiere fois, et il ne lui cacha pas qu’il trouvait impossible 
qu’il ait pu lire et oomprendre l’ouvrage prdtd. Je ne me souviens plus par quelle transition d’Alembert 
fut amend k interroger le jeune Laplace sur cette lecture, le rdsultat fut qu’il fut d’abord dtonnd, puis 
intdressd. Il passait pour certain chez moi, que d’Alembert se serait k oe moment occupd de Laplace. 
Je ne suis pas au oourant du ourrioulum vitae de mon aieul jusqu’d la Rdvolution. Il s’dtait marid 
probablement peu de temps auparavant. Tons mes papiers dtant brtilds, je ne puis faire que de con¬ 
jectures. Sa femme, mon arridre-arridre-grand’mdre dtait d’une famille de Besangon, de magistrature; 
et s’appelait de Courty de'Romanges—enoore sur oe point j’hdsite et ne suis stir que d’une ohose, c’est 
que l’onole Here ale de Courty, qui par son dnergie a refait la fortune de la famille, devait dtre le frdre de 
Madame de Laplace—car comment [autrement] aurait-il pu dtre le grand-onole de ma grand’mdre? Cette 
famille Courty nous avait laisad des portraits de parents—qui s’appelaient de Mollerat. Je donne oes 
renseignements qui pourront peut-dtre alguiller vos rechercb.es. 

H a dtdpublid, par M. Marmottan, 15 mars, 1922, un volume des lettres de Madame de Laplace k la 
prinoesse Elisa, dont elle fut dame d’honneur. Le mdnage Laplace eut deux enfants: (i) Sophie de Laplaee , 
nde en 1787, qui dpousa le marquis de Portes, dont une fille, Angdlique de Portes, maride vers 183d an 
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lecture notes wherein I have endeavoured to see in its true light the accusation of 
time-serving made against Laplace. I had been discussing in my lecture the 
charge made against Laplace, namely that he did not adequately acknowledge the 
work of other men; that it was unlikely that he could have failed to see either 
Thomas Wright of Durham s Original Theory of the Universe, 1726, or Kant’s 

M l "de Colbert-Chabunuis, mon grand-pore. (ii) Emile de Laplace , 15 avril, 1789, el4ve de polytechnique 
et de l’Kcole de Metz, qui mourut general (nomratf en 1848), le 27 octobre 1874. Laplace <Hait ami de 
Bailly. Si mes souvenirs nont exacts, ii liabitait alors 4 Paris, dans la maison qu’on appelle pavilion de 
Hanovre, chez M. Arthur (? un anglais (tout h fait sous reserve)). 

Quelles 4taient ses idoes 4 oette epoquo ? II dut probablement penser comnie ses contemporains. Les 
ideeB de 1789 devaient correspondre 4 son ideal de justice. Comme me l’a dit mon p4re, Laplace 4tait 
passioiinfi de justice, et il aurait dit quo sa creance en Lieu deriva.it de cette idee de justice. Mais la 
Involution tWolua, ct il se trouva en butte aux tracasseries. Ma tan to, la D MO de la Koohefoucauld 
Doudeauville, me racontait comment utie visite domiciliaire avait trouv6, dans la chambro de la petite 
bonne qu’ils avaient, une image do piete clout>e au mur, et les ennuis qui en survinront. 

Toujours est-il quo Laplace et les siens quitterent Paris, et allerent du cote de Melun, aux M4es (si 
j’ai bon souvenir). J’ai eu en ma possession plusieurs certificats de civisme ou de presence, qui lui furent 
dAlivr4s. J’ai oublie la date exacte de ces documents maintenant detruits par le leu. 

Je crois avoir entendu dire 4 mon pere, que Bailly fut arr^te lorsqu’il se rendait cbez Laplace. 
Comment, dans la suite, Bonaparte fit-il connaissance de Laplace, je l’ignore, mais ce qui me parait 
ccitain, e’est qu’il y eut des rapports d’amitn'* entie ces deux hommes. .Je possedais des petits cadeaux 
du G“ l Bonaparte qui semblent prouver ce que j’avanee. IJne livrc de cafe moka, un c6drat, un 
cachemire, des bouteilles de vin de Constance, donnes par lo G" 1 4 son retour d’Egypte. Comment 
encore, aprds le 18 brumairc, Bonaparte nomma-t-il Laplace min intro de l’lnterieur? C’otait une 
fonction tout 4 fait en deliois des aptitudes du savant. Aussi Bonaparte par un billet; du. .rendait-il 
Laplace 4 ses occupations soientifiques. Je pense que Bonaparte voulait Hatter l’lnstitut, et faire tenir 
provisoirement une place qu’il dcstinait 4 son fr6re. 

Laplace avait aelicto de liewbell une maison 4 Arcueil. Je l’ai liabitee dans mon cnfance. Il y avait 
un pare superbe, ct la maison <Hait tres grande et fort belle. Berthollet tUait son voisin. 11s se rtfunisBaient 
sou vent avec d’autres savants, et de cette societb sortirent quolques travaux, qui furent publics sous le 
noin de Travaux de la Hoci4t6 de Arcueil. On y dinait en famille, un de ces savants M. Bouvard y tenait 
une place speciale. Il faisait enrager ma grand’mdre, parce qu’il avait la mauvaise habitude de cracher 
4 terre—elle le grondait, puis on disait “AllonH, M. Bouvard, la main aux dames,” et en boitant, 
M. Bouvard ollrait la main a Madame de Laplace pour passer 4 table, oh suivant t’usago d’alors, elle 
servait le potage 4 tout le monde. Je Bais que M. Magendie venait aussi 4 Arcueil. A la mort de 
ma grand’mere en 1889, no pouvant conserver cette propria, elle fut vendue, suivant le d4sir exprim6 
par le G nl Laplace, aux dominicains qui posR^daient dd?j4 la maison Berthollet qt y avaient fond6 un 
college. La loi des congregations a fait passer cette propriety dans les mains de l’Etat, qui a loti le pare 
magnifique. Je n’y suis jamais retouruc. 

De souvenirs savants, je no puis en avoir. Il faut neanmoins que je vous raconto ce que ma 
grand’mere avait rctenue. Laplace, parait-il, <$tait tout 4 fait contraire 4 l’id&j, que les changements de la 
lune ont une influence sur la temperature. Ma grand’more, qui disait tenir cette opinion delui, y tenait 
fermement et me rabrouait s’il m’arrivait d’exprimer l’opinion que la lune allait changer le temps. 

Mon p4re m’a raconto aussi, que c’4tait ma grand’more, Ag4e de 13 ou 14 ans, qui avait ddtermind 
Torthographe du nom. Lorsque la nouvelle que le roi Louis XVIII avait noramd Laplace marquis 
arriva 4 Arcueil, Laplace etait encore au lit, et sa petite-fille on allant lui dire bonjour, requt de lui cette 
demande, “Voila, qu’ils m’ont nomme marquis, Ang41ique, comment va-t-il falloir que je signe?”— 
“ MaiH bon-papa, e’est tout simple, marquis de Laplace.” 

J’ai pu sauver de l’incendie une vieille moutre divis4e en 10 heures—et le chronomdtro de Borda— 
de mon [arri4re-arri4re] grand-p4re. 

Je m’excuso, Monsieur, d’avoir mis plus de deux mois, 4 dorire cette lettre, mais j’ai dA voyager. 
Je ne recommence m6tne pas oette lettre, dcrite en rappelant des souvenirs d4j& vieux et qui manque de 
la forme et de la concision que j’aurais voulu lui donner. On peut trouver (je n’ai plus les titres 
exacts) des renseignements sur Laplace dans des oeuvres de MM. Biot et Arago—ce dernier s’est montrd 
un peu dur pour le vieillard qui l’avait accueilli—-et que M. Andoyer a 6crit un petit livre sur Laplace. 
Je vous envoie le discours de M. E. Picard. 

Veuillez, Monsieur, avec toutes mes excuses, agr6er l’expression de ma consideration la plus 
diBtingu£e. A. de Oolbert-Laplace. 

N.B. Dans ma biblioth&que de Mailloc sc trouvaient le masque de Newton et, au-dessus, l’extrait 
d’une lettre de M. Davy r6pondant a l’envoi par Laplace de Bes oeuvres. Il y etait dit “ qu’elles avaient 
ete placees sur la memo planche que celles de Newton, ne pouvant leur faire un plus grand honneor.” 

[I have not discovered any work of Arago on Laplace beyond his Report , which speaks in the highest 
terms of Laplace’s researches. After a vain search among the Paris booksellers for Andoyer’s work, 
1 wrote to inquire directly of him, but the illness followed by the most regrettable death of that savant 
has hindered any reply. K.P.] 
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Allgemeine Naturgeschichte und Theorie des Himmels, 1755* Yet I reminded 
my audience that the Mteanique celeste did not profess to be an original memoir, 
but a gigantic treatise on the mechanics of the universe, in which the author not 
only included all that was already known in his own day, but an immense amount 
of new matter. Is it, I continued, more reasonable to blame Laplace than to blame 
Euclid, many of whose propositions must have been known long before his day? 
Few of the treatise writers even of last century were like D. F. Gregory or E. J. 
Routh, and took the trouble to assign to their original discoverers the results they 
made use of I In this respect we may indeed approve the words of Gregory: “It has 
always appeared to me that we sacrifice many of the advantages and more of the 
pleasures of studying any science by omitting all reference to the history of its 
progress.” How many English text-books of Algebra have been writton in which 
no reference is made to Newton as the discoverer of the Binomial Theorem, or to 
Halley who first stated the Exponential Theorem ? 1 do not desire wholly to excuse 
Laplace; he was distinctly worse in this respect than Lagrange, but not worse than 
the bulk of French writers even up to the present date. Nor can we suppose that 
by omitting authors' names Laplace intended to claim their results as his own. 
The audience for which his work was intended was really the very one that knew 
well what Lagrange, Euler, Clairaut and others had accomplished. Laplace, like 
Ptolemy, set about writing his Almagest , a great work that should embrace all that 
was known about the heavens in its authors day. So much, and so much especially 
of importance was original, that readers of a later day are apt to consider it all 
original, and express anger when they find it is not so. We expect a big man in 
mind to be a big mail in heart, and it is a pity that Laplace was not more careful 
to be generous f to his compeers, marking off his own from other mens contributions 
to the mechanics of the planetary system; he would have lost no fame, by doing so. 
He certainly did not set out to steal, but he followed the usage—if a bad one—of 
his nation. His treatment of his own countryman Legendre and of our English 
Thomas Young shows that he was far from careful not to wound the susceptibilities 
of men who had made very real contributions to knowledge. 

The remarks which the Mdcanique celeste calls forth apply as strongly, if not 
more so, to the Thiorie analytique des Probability. Laplace put together all that 
was known of the subject in his day, and immensely added to and developed his 
material. But only those intimately acquainted with what Montmort, De Moivre, 
the Bernoullis, Condorcet and Lagrange had achieved, can fully grasp how much he 
owed to them not only for fundamental principles but for suggestions for further 
research. 

The second matter wherein Laplace has been severely criticised is in relation 
to political affairs. Thus we find it asserted that overmastering vanity led him to 

* The result has been that the Nebular Hypothesis is almost universaUy attributed to Laplace, 
although actually he only restated and developed it. 

f For the real generosity of Laplace towards his friends and pupils the reader must turn to the 
memoirs of Biot and Poisson, and also to the graceful picture of Madame de Laplaoe and her husband 
as hostess and host to both students and savants at Aroueil. 
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seek office, and “souplesse”—a flexible obsequiousness—and to praise each successive 
French ruler. Here again we must judge neither hurriedly nor too harshly. Let 
us examine matters historically, and call as well for evidence. Let us note in the 
first place the words of Robespierre, namely that with the new order in France there 
was no need for men of science; let us remember that the new rulers had suppressed 
the Academie des Sciences , had condemned Condorcet to death, and had sent 
Lavoisier, Laplace’s friend, to the guillotine; let us recall that Lagrange, Berthollet, 
and possibly even Laplace, were allowed to survive because they were useful to the 
Republic for the purposes of its munition factories and artillery schools. Let us 
consider the corresponding condition of science under Soviet rule in Russia today, 
where the Government is “purging” the St Petersburg Academy by ejecting 
obnoxious Academicians and by forcing for political reasons, not solely for their 
scientific reputation, its own nominees on that institution. 

Laplace came young to Paris, and like the much older Condorcet himself may 
well have thought that great results would flow from the Revolution; many men— 
politically wiser than Laplace—also made that error. As soon as Laplace reached 
Paris he developed a definite purpose for his life, namely to compile his Mechanics 
of the Heavens; his fame grew rapidly with the years, and he became, outside 
France, one of the most famous Frenchmen of his day; to fulfil his mission he had 
to regard governments, and in a somewhat different sense they in turn had to 
regard him. He was a national asset, which could be destroyed as Lavoisier was 
destroyed or else must be used and honoured. I find nothing remarkable in suc¬ 
cessive French governments conferring posts and dignities upon him. The less so, 
as there is a closer link between science and the state in France than elsewhere, 
and many of his dignities followed a well-established routine. Let us take 
an illustration which has been cited as an instance of time-serving. Laplaces 
Exposition dusyst&me du monde was published in 1796, four years after the founda¬ 
tion of the Republic; it was written through the days of the reign of terror, and 
though the revolution was running to its end Laplace could not foresee that end. 
Laplace dedicated his work to the Council of the Five Hundred and it concluded 
with the sentences* 

“Let us preserve with care, let us ever increase our stock .of this mighty 
knowledge, the delight of thinking beings. It has rendered important services to 
agriculture, to navigation and to geography. But the greatest benefit of the 
Astronomical Sciences is the dissipation of errors bom from man’s ignorance of his 
true relations to nature, errors the more grave in that the social order ought to 
rest entirely on these relations. Truth and Justice we find are their immutable 
foundations. Let us put far from us the dangerous maxim that sometimes it is useful 
to deceive or enslave mankind the better to ensure its happiness. Fatal experience 
has shown in all ages that these sacred laws are never infringed with impunity.” 

I admit that it is not clear how Laplace passes from the physical side of the 
universe to the moral side in social matters; there may indeed be a small element 
of rodomontade in the later sentences, but that they should be interpreted as the 
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words of a time-server seems to me a gross exaggeration of the facts. Laplace was 
at this time organising the Nicole polytechnique and presiding over the commission 
to report to the Council of the Five Hundred on the progress of science (1796). 
Twenty-eight years later when Laplace in 1824, an old man, is accused of suppress* 
ing this peroration, he wrote as follows: 

“Let us preserve with care, let us ever increase our stock of this mighty 
knowledge, the delight of thinking beings. It has rendered important services to 
navigation and geography *, but its greatest benefit has been the scattering of the 
fears that flow from the phenomena of the heavens, the destruction of errors which 
flow from our ignorance of our true relations to nature; errors and fears, which 
will promptly be reborn, if the torch of science should over be again extinguished.” 

I think it unreasonable to assert any connection between the marquisate con- . 
ferred on Laplace and this change of peroration. In the course of 28 years Laplace 
may well have learnt that the vague use—without definition—of such terms as 
“truth” and “justice,” terms ever-changing with the atmosphere man attaches to 
them—was not philosophic, and to describe these terms as “immutable bases,” when 
their interpretations are modified by every change in public opinion, was hardly 
worthy of a great man of science. But the whole accusation of time-serving seems 
to disappear, when we examine the editions of the Exposition , which appeared 
between 1796 and 1824. In the edition of 1799, Truth and Justice are cited no longer 
as “foundations” but as “immutable laws.” In the third edition of 1808, although 
Napoleon is now in command, the word Humanity is added to Truth and Justice 
as one of the “immutable laws.” Thus these words can hardly be held as a catering 
for the applause of the Revolutionists! In the fourth edition of 1813, all these 
words have disappeared; thus their disappearance can have nothing to do with the 
later marquisate (1817). The concluding words of his Exposition were clearly a 
great trouble to Laplace and he altered them with each edition. 

We may now pass to other points in his political career. Napoleon, who had 
been educated at the Brienne Military School, was in 1784 transferred to the licole 
militaire in Paris, and doubtless came in touch with Laplace, who had been appointed 
professor in that school in 1773. Further Napoleon received his commission as a 
sub-lieutenant of the artillery in 1786,Laplace having been appointed “examinateur” 
of the scholars of the royal artillery corps in 1785. Remembering that Napoleon 
prided himself on his mathematical knowledge, it is highly probable that he felt 
attraction and admiration for Laplace. When Napoleon came to be Consul and 
ultimately Emperor, his theory of government was that of a highly organised military 
state; the people should have equality and justice, indeed all that was good for them, 
but forced on them from above, not a product of their own activities. In order to 
carry out this idea of a highly organised state Napoleon chose men not by their 
birth, but by their ability, and made them servants of the state. Thus he appointed 
David state painter; to Cuvier and Laplace he delegated the reconstruction of the 
scientific and educational institutions of France. He chose what he held to be the 

* Note agriculture has disappeared! Had Laplace recognised that it would be a long time before 
meteorology could be reduoed to a true science ? 
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ablest men, and if his judgment failed occasionally it did not often fail, but one of 
those* failures was his belief that a great mathematician would be ipso facto a 
great administrator. One biographer tells us without citing tiny authority that 
Laplace repeatedly begged Napoleon to make him Minister of the Interior. 
Agnes Clark with no authority writes: 

“But merely scientific distinctions by no means satisfied his ambition. He 
aspired to the role of a politician, and left, a memorable example of genius degraded 
to servility for the sake of a riband and title. The ardour of his republican 
principles gave place, after the 18th Brumaire [179(1], to devotion towards the first 
Consul, a sentiment promptly rewarded with the post of minister of the interior.’' 
( Encycl . Brit Vol. Xiv. p. 302.) Rouse Ball writes: “It would have been well for 
Laplace’s reputation if he had been content with his scientific work, but above all 
things he coveted a decoration. The skill and rapidity with which he managed to 
change his politics as occasion required would be amusing if they had not been so 
servile. As Napoleon’s power increased Laplace abandoned his republican principles 
(which had themselves gone through numerous changes, since they had faithfully 
reflected the opinions of the party in power) and begged the First Consul to give 
him the post of minister of the interior. Napoleon who desired the support of men 
of science accepted the offer V* Such writing of history without a single reference 
to sources is pitiable. As far as I am aware Laplace before, during and after the 
Revolution never expressed anywhere his political opinions. How then did Rouse 
Ball discover that they had gone “through numerous changes’* and “faithfully 
reflected the opinions of the-party in power”? Such Statements published by a 
writer of one nation about one of the most distinguished men of a second nation, 
and wholly unsubstantiated by references, are in every way deplorable. When*, did 
Rouse Ball’s information come from ? i-believe it to be merely an exaggeration of 
a catch-penny character from the account of Agnes Clark, one of the most 
superficial writers that ever obscured the history of science. At the end of her 
account Miss Clark gives as her authorities the Eloge of Fourier, the Funeral Oration 
of Poisson, and the Report of Arago to the French Government on the works of 
Laplace. I know all these writings well, there is not a single word in them that 
justifies her defamation of*Laplace! Miss Clark says that notices of Laplace’s life are 
scanty, so they are, but that in itself is no rehson for allowing a too facile pen to 
give full freedom to an inventive imagination! The Eloge spoken by M. do Pastorct 
in the Chamber of Peers on April 2, 1827 refers to Laplace’s political career; it 
has apparently-been overlooked by both Agnes Clark and Rouse Ball. It speaks well 
of Laplace’s procedure both in the Senate and Chamber of Peers, it states that Kis 
speeches on the budget, on criminal instruction, and on the export of grain were* 
always clcarandilluininating. Where again, I ask,did these smaller.English historians 
draw their characterisation of Laplace as a time-server and a futile politician ? 

Probably, I believe, from an article on Laplace by Augustus De Morgan in the 
Penny Encyclopaedia of 1835. That distinguished mathematician had a fatal bent 

* A Short Hiitory.of Mathematics, 1888, p. 890. 
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towards damaging the scientific and moral reputations of greater mathematicians. 
I need only cite his treatment of both Newton and Laplace. 

De Morgan says that Laplace voted in 1814 for the deposition of Napoleon. It 
is indeed difficult to see what else any wise and patriotic Frenchman could do. 
Even De Morgan fully admits this, but he says that the suppression of the dedication 
to Napoleon of the Thiorie analytique des Probability (which appeared in the first 
or 1812 edition of that work), when the second appeared in 1814 after the deposition 
of Napoleon, is primd facie evidence of ingratitude and cowardice. 

Now let us read the dedication carefully. It runs: 

Sire, The kindness with which your Majesty has deigned to receive the hom&ge of my 
Treatise on the Mechanics of the Heavens has inspired me with the desire of dedicating to you this 
work on the Calculus of Probabilities. This delicate calculus applies to the most important 
questions in life, which are for the most part only problems in prohvbility. It ought for this 
reason to interest your Majesty, whose genius knows so well how to appreciate and worthily 
encourage all that can contribute to the public illumination and prosjasrity. I venture to ask 
you to agree to this new homage dictated by the most keen gratitude, and profound sentiments 
of admiration and raqKJCt, with which I am, Sire, the very humble and very obediont servant 
and faithful subject of your Majesty, Laplace. 

However we may judge of Laplaces original rendering unto Caesar of that 
which is Caesar's, it is perfectly clear that no publisher in 1814 could be found, 
or if found would have been permitted, to reprint in Paris in the year of the 
Emperor s deposition that dedication 1 What Laplace’s real wishes may have been 
we do not know, but whether he wished to reprint it or not, the Censor would 
most certainly not have permitted its republication. More than once in the course 
of his career De Morgan has erred in his judgments, because he failed to grasp 
that we cannot estimate the worth of a man without an understanding study 
of his environment. It is the more remarkable in this case because Dc Morgan 
himself takes a sound view of Laplace’s religious opinions: 

It is sometimos stated by English writers that Laplace was an atheist. We have attentively 
examined every passage which has been brought in proof of this assertion, and wo can find 
nothing which makes either for or against such a supposition. It is easy, with an hypothesis, to 
interpret passages of an author; but we are quite convinced that a person reading Laplace for 
philosophical information would meet with nothing which could either raise or solve a question 
as to the writer’s opinions on the fundamental point of natural religion, unless it had been put 
into his head to look. If those who make the assertion have any private grounds for it they 
should produce their evidence; but the assertion, whether considered with referonoe to the 
individual, or to the public before which it is made, should not be hazarded merely because 
a writer who is investigating such points ag oan be determined by experiment and analysis does 
not introduce his opinions on a question which cannot be submitted to calculation. An attempt 
to explain how the solar system might possibly have arisen from the cooling of a mass of fluid or 
vapour is called atheistioal, because it attempts to ascend one step in the chain of causes; the 
Principia of Newton was designated by the same term, and for a similar reason. What Laplace’s 
opinions were we do not know; and it is not fair that a writer who, at a time of perfect license on 
such matters, has studiously avoided entering on the subject, should bo stated as of one opinion 
or the other, upon the authority of a few passages of which it can only be said (as it could equally 
be said of most mathematical works) that they might have been written by a person of any 
religious or political sentiments whatever. 
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If De Morgan had applied his own statement as to expression of religious 
opinions to a consideration of Laplace’s political views for which it certainly holds 
as fully, then undoubtedly he would not have given rise by the tone of his article 
to such writings as those of Agnes Clark and Rouse Ball who assert that Laplace 
changed his views with every change of government in France. Laplace has 
nowhere expressed any political views whatever, for the dedication of a book to the 
head of a state cannot be looked upon as an expression of political sentiments. 
The permission to dedicate a new and important work to the sovereign was, in 
that day, equivalent to the statement that the book was approved by the state 
and it thus formed a much desired and excellent publisher’s advertisement. 

That Laplace failed as Minister of the Interior may be quite true and not in 
the least to Laplace’s discredit. What was needed in 1799 was a firm hand, and a 
head which would not consider minor points of justice or duty, but act promptly 
and forcibly. It needed a soldier, and one who would take his orders from Napoleon 
as from his commanding officer. That was the essential reason why Napoleon after 
six weeks’ experience replaced Laplace by his brother Lucien Buonaparte. In times 
of turmoil, when the success of the Consulate hung in the balance, Laplace was no 
more fitted than Condorcet to take a leading part. We may consider that it was 
a bad mistake for Laplace to accept the office, but there is no ground to suppose 
that he pressed (“repeatedly begged”) Napoleon to give him the post. 

Years afterwards* at St Helena ( Mdmoires de Sainte Helene ) Napoleon thus 
described the incident: ‘‘Mathematician of the highest rank, Laplace was not long 
in showing himself an extremely poor administrator. From his first actions I realised 
that I had deceived myself. He sought everywhere for subtleties, had only 
problematic ideas, and carried the spirit of the ‘indefinitely small’ into adminis¬ 
tration.” Yet if Napoleon had failed in his judgment of Laplace its an administrator, 
it did not modify his admiration of him as a mathematician. In 1802 Laplace 
dedicated to General Buonaparte the first volume of the Mecanique celeste , and 
Napoleon in thanking him spoke as a mathematician when he said that the first 
six months of freedom he had he should devote to reading the splendid work. After 
reading some chapters he wrote again regretting that circumstances had forced him 
into a career so far from that of science. Again three years later in 1805 whon in 
Milan he wrote : “La Mecanique cdleste appears to me destined to give new fame to 
the century in which we live.” Lastly in 1812 when the Thdorie analytique des 
Probabilitds reached him at Witepsk on his march to Moscow, he wrote: “There 
was a time when I should have read with interest your Treatise on the Calculus of 
Probabilities. Today I must limit myself to expressing the satisfaction which I feel 
whenever I see you producing new works which render more perfect and advance 
further the first of the sciences, thus contributing to the lustre of our nation. The 
advancement and the perfecting of mathematics are bound up with the prosperity 
of the state.” 

Bouse Ball calls it 41 Napoleon’s memorandum on the subject,” as if it had been given at the time 
as a reason for dismissal, loe, cit. p. 890. 
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I do not think these were merely formal and therefore idle expressions of thanks; 
they were evidence that he felt—after himself—Laplace to be in the eyes of Europe 
the chief honour of his nation. One other anecdote about the Mdcanique celeste has 
survived. Napoleon meeting Laplace said to him: “M. Laplace, they tell me you 
have written this large volume on the system of the universe without ever 
mentioning its Creator.” Laplace drew himself up and said: “I had no need of 
such an hypothesis.” This does not sound like a time-server! Napoleon mentioned 
the incident to Lagrange, and the the latter exclaimed: “Ah, it is a beautiful 
hypothesis, because it reaches so far.” 

In this respect another saying of Laplace’s may be cited, far less boastful than 
it appears: “Give me matter and I will create the universe.” Laplace saw that 
adequate knowledge of an element of matter would like Tennyson’s full under¬ 
standing of the flower in the crannied wall explain all in all. Laplace’s dying words 
show that he was no vain boaster, and how the words “Give me matter” are to be 
interpreted; those around his bed were recalling to him the great discoveries he 
had made in life, Laplace replied: “What we know is but a little thing; what we 
are ignorant of is immense.” 

The last days of Laplace have been briefly described by Fourier who writes: 

He had contracted the habit of excessive application so harmful to health, so necessary when 
studies are profound; nevertheless he did not experience any eufeeblement until the last two 
years of his life. At the commencement of the illness to which he succumbed, an instant of 
delirium was observed with fear. The sciences still occupied his mind. He spoke with un¬ 
accustomed fire of the movements of the stars, and afterwards of a physical experiment which ho 
said was crucial, announcing to those ho believed to bo present that he would soon make a com¬ 
munication to the Academy on these problems. His strength diminished more and more. His 
doctor [the famous physiologist Magendio], who by his talents and by the care which friendship 
inspired in him, merited Laplace’s entire confidence, watched by his bedside. M. Bouvard, his 
collaborator and his friend, never for a moment quitted him. 

“Surrounded by a beloved family, under the eyes of a wife whose tenderness 
had aided him to support the trials inseparable from life, whose amenity and grace 
had shown him the worth of domestic happiness”—the great mind parted from its 
mortal frame, and according to Fourier “returned to the heavens”—perhaps the most 
fitting place for the genius who had timed the courses of the stars in. their paths. 

Personally I prefer to quote the lines with which Virgil opens the second book 
of his OeorgicSy lines which Laplace so highly appreciated that he placed them at 
the head of his Micanique celeste : 

Ye muses, beloved beyond all else, whose sacred emblems I bear, penetrated as I am by 
ardent love, take me to yourselves, and show me the pathways of heaven, and the stars that 
traverse them*. 

This the Muses accomplished for Laplace, and by their grace his genius may 
live immortal, even as Calypso, the fair nymph, promised immortality to Odysseus 
after his toils, should he be content to remain faithfully with her. 

# Me voro primum, dulces ante omnia Musae, 

Quarurn saora fero, ingenti perculsus amore, 

Aecipiant, coelique vias ao aider a monstrant. Georgia , u, 11. 475-7. 
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Mrs Somervilles final summary of the Micanique celeste may be cited here, 
it still remains true ( Mechanism of the Heavens, 1831, p. 2) : 

Tables of the motions of the planets, by which thoir places may be determined at any instant 
for thousands of ycais, are computed from the analytical formulae of La Place. In a research 
so profound and complicated, the most abstruse analysis is required, the higher branches of 
mathematical science are employed from the first, and approximations are made to the most 
intricate senes. Eiisier methods and more convergent series may be discovered in process of time, 
which will supersede those now in use; but the work of La Place, regarded as embodying the 
results of not only his own researches, but those* of so many of his illustrious predecessors and 
contemporaries, must ever remain, as he himself expressed it to the writer of these pages [Mary 
Somerville], a monument to the genius of the age in which it appeared. 

It is the Almagest of a century ago, and Laplace’s own description of it you will 
note is far from claiming all its results as the product of his own brain. 

One word, I think, may be safely added about a strange incident connected with 
part of Laplaces mortal remains, namely his brain. Magendie, his physician, must 
have held an autopsy and removed in the course of it Laplace’s brain. No report 
that I can hear of was ever published about the results of the autopsy. But about 
fifteen months after Laplace’s death, on June 16, 1828, a paper was read by 
Magendie before the Academic des Sciences; it is entitled “ Memoirc physiologique 
sur le cerveau." The theme of this memoir is that the less cerebro-spinal fluid in 
the brain the greater is the intelligence. In other words the less fluid the more 
thinking matter. Towards the end of this memoir occurs the following paragraph: 

I once found myself under the sad necessity of examining the brain of a man of genius, who 
died at an advanced age [ 78 ], but when still enjoying the fulness pf his intellectual faculties. The 
sum of cerebro-spinal fluid was not more than two ounces, and the cavities of the brain contained 
at most a dram. ' 

There is no mention of Laplaces name, and although I have searched French 
literature I can find no further details of either the autopsy or the brain of Laplace. 
Here the matter might have rested had not Miss M. Tildesley in 1927 brought me 
a remarkable letter with which she had been entrusted by Miss Helen Hunter 
Baillie whose name indicates her relationship to Mrs Joanna Baillie and to her two 
brothers William and John Hunter, famous authoress and famous anatomists. 

The letter dated only “Hampstead, Monday, 1834” is from Joanna Baillie to her 
great-niece Miss Sophy Milligan, and contains the following important paragraph: 

My dear Sophy...Dr Somerville told us not long ago a whimsical circumstance regarding the 
head of La Place tho famous French Astronomer. Some Ladies and Gentlemen went one day to 
the house of Majendie [«<?/] the great anatomist to see the brains of this Philosopher, which they 
conjectured must be of a very ample size, and seeing a preparation on the table answering tlieir 
expectation they were quite delighted. “ Ah I see what a superb brain, what organs, what 
developments! This accounts completely for all the astonishing power of his intellect, etc.” 
Majendie, who was behind them and overheard all, stepj>ed quietly forward and said: “ Yes, that 
is indeed a large brain, but it belonged to a poor idiot, who when alive scarcely knew his right 
hand from his left. This, Ladies and Gentlemen ” (handing to them a preparation of a remafkably 
small brain), “this is the brain of Laplace/ Dr Somerville was told this anecdote by Majendie 
himself..Your Affectionate Aunt, J. Baillie. 
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“Dr Somerville” can scarcely be other than the physician, fellow of the Royal 
Society, and husband of Mary Somerville, the learned lady who studied Newton's 
Prinaipia in the original, was the friend and correspondent of Laplaoe, and 
paraphrased in her Mechanism of the Heavens his Mtcanique celeste. There is 
accordingly no doubt that Magendie was in possession of Laplace’s brain 6 or 7 
years after his death, and that this is the brain to which he referred in his “M6moire 
physiologique sur le cerveau,” probably written in the year of Laplace’s death* 

I have tried in vain to ascertain in Paris what became of Magendie’s collections 
when he died in 1855. Probably they were sold like his books. Such is the second 
chapter in the history of Laplace’s brain. I now turn to the third stage in this 
history. I published Joanna Baillie’s letter in Nature asking if any one knew what 
had become of Laplace’s brain. I received a strange answer. Let me digress for a 
moment. I well remember as a boy, perhaps I was 10 to 12 years old, a mysterious 
Museum near the foot of Regent Street, I think near Glasshouse Street, an 
Anatomical Museum; if I remember rightly, it had a skeleton or the model of a 
tailed man in the window. Such things had a certain fascination for me, but the 
mystery of the place was increased by a strict injunction from my parents never to 
go inside. Under the circumstances, I think, most boys would certainly have gone 
inside. I, as fortune would have it, did not. I don’t think it was because I was a 
good boy, because I was not; but rather because I was a coward and, although 
curious, had not the courage to face the contents of that Museum. A different 
type of parent and a different boy led to the discovery I am about to communicate 
to you. I received a letter from Mr A. B. Bence-Jones dated June 16, 1927 from 
11 King’s Bench Walk, Temple: 

Dear Sir, The Brain 0 y Lap i ace . 

I am much interested in your letter printed in Nature , 16 April 1927 
on this subject, but I have nothing to suggest except very indirect evidence. As a 
boy I recall a visit with my Father (who died in 1873) to Kahn’s or Klihn’s Museum 
in Glasshouse Street, Regent Street, and my attention was directed to a glass vessel 
said to contain the brainof Laplace, but I cannot recallany mention of Magendie’s name. 

I do not know if any record or catalogue of this Anatomical Museum exists. 
Probably some one at the London Museum, Lancaster House, S.W. 1 would know, 
and I regret that I cannot make inquiry there, just now. 

I need only add that my Father was Dr Henry Bence-Jones, F.R.S., and was 
Secretary of the R. Institution and author of Faraday’s Life and Letters . 

I am, dear Sir, Yours* faithfully, 

A. B. Bence-Jones. 

In a second letter Mr Bence-Jones says that he must insist on Kahn’s Museum 
being in Glasshouse Street: “My one visit to Glasshouse Street impressed me much 
and in days not later than the seventies, I often observed the place, and shuddered 
at my recollection of its nature. It was revolting to a boy.” 

Now Magendie died in 1855. Professor C. Richet kindly tells me that Magendie’s 



214 Laplace 

books were sold, and that he has at the present time certain of them. It seems 
probable therefore that his collections with their preparations were also sold. Now 
how did Laplace’s brain come into Kahn’s Museum ? Joseph Kahn was a doctor of 
medicine of the University of Vienna—he set up an anatomical museum and at 
first appears to have moved the collection up and down England, exhibiting both 
in Newcastle and London. He started apparently in this country about 1851, and 
catalogues of the contents of his museum were published in London, 1851, Ncwcastle- 
upon-Tyne, 1852, London, 1853 and later dates. No catalogue appears to have been 
published after the date 1855 or 1856 at which we may suppose Magendie’s 
collections to have been purchased. 1 see no reason to believe that originally 
Kahn’s M useum may not have been what it professed to be —a museum for the study 
of anatomy—but its proprietor soon found that the shillings rolled in from an 
inquisitive lay public, and accordingly I)r Kahn started introducing monstrosities 
of all types approaching near to those of the showman at village fairs. Kahn’s 
residence at one time was 17 Harley Street and one may suppose he then desired 
to build up a consultant practice. The Museum was at 315 Oxford Street in 1851, 
in Coventry Street, Leicester Square in 1856, in 1864 at 3 Tichborne Street, 
Haymarket, where apparently Joseph Kahn resided. In the following years 1865, 
1866 it goes on under George Kahn, M.D. at the same address. Probably George 
was a son of Joseph. In the Directory for 1867 there is no occupant given for the 
house No. 3 Tichborne Street. The Kahns and their museum seem to be wanting 
in the later Directories . The ultimate source of this disappearance was probably 
due to the medical journal—the Lancet —which on June 3, 1865 (p. 600) published 
an article inveighing against “anatomical museums” and calling for measures to be 
taken for their suppression. The matter attracted considerable public attention, but 
nothing appears to have been done till the police at the instance of the Society for 
the Suppression of Vice in March 1873 seized Kahn’s anatomical models, and on 
application by the Solicitor of the Society these models were destroyed at the 
Marlborough Street Police Court on Dec. 18, 1873. The Solicitor for the Society 
said that the museum was closed and the models not seized by the police packed 
up to be sent abroad. Kahn himself appears to have been tried on January 2, 1874, 
but I am unaware of what was the result. 

It seems highly improbable that the police would remove exhibits like the brain 
of Laplace, which under no circumstances could be interpreted as conducive to 
immorality, and there must have been many similar exhibits. Assuming as seems 
highly probable that Kahn purchased Magendie’s preparations then the appearance 
of Laplace’s brain in Glasshouse Street becomes explicable. Granted this, it would 
be of great interest to ascertain the present locus of the remainder of Kahn’s 
collection. Probably finding London no longer profitable Kahn retreated to his 
native land or to Germany, taking with him his unseized models, his tailed men, and 
the still unstudied brain of Laplace. I have not hitherto succeeded in tracing Kahn 
or his collections. Dr G. M. Morant kindly communicated to me a curious point; he 
told me that when he was in Munich a few years back advertisements were posted 
about the streets, announcing the arrival and sojourn for a time in Munich of a 
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great show—an anatomical museum. I wonder if this was Kahn’s original collection, 
and if so whether it still contains the brain of Laplace—the brain of possibly 
the greatest mathematician of the ages travelling about the continent in a 
showman’s van! 

Imperial Caesar, dead and turned to clay, 

May stop a hole to keep the wind away. 

The rest of Laplace’s mortal remains were buried at Paris in the P&re Lachaise 
cemetery. There they remained for 61 years until 1888, when they were exhumed 
in fulfilment of the desire of his son General de Laplace (who died in 1874 at the 
age of 84) and taken to the family estate of Saint Julien de Mailloc, a small hamlet 
between Lisieux and Orbec in Calvados. On the bye road to these places is a 
Greek temple with a bronze urn containing the heart of Laplace, and inscriptions 
commemorating the birth and death of Laplace and the dates of publication of his 
chief works*. Other members of the family are buried in this temple. The 
monument from Pore Lachaise was given by the Laplace family at the same time 
to the commune of Beaumont-en-Auge, where it was re-erected in the cemetery. 

In June 1871 the Laplace sanctuary at Arcueil which had escaped the Prussians 
was raided by a band of ruffians from the Mouffetard district. The manuscripts of 
the great mathematician were thrown into the river Bievre, from which that of 
the Mtcanique celeste was subsequently fished out. The library which was rich in 
rare books, souvenirs and works of art was looted and devastated (see Nature , Yol. iv. 
p. 108), a sorry ending to Madame Laplace’s piety f! Probably owing to this 
occurrence the remaining personal relics and papers of Laplace were transferred to 
Saint Julien de Mailloc, but the family chateau at de Mailloc was completely 
destroyed by a fire on Dec. 11,1925, and thus perished the whole of the Laplaciana. 
I must confess that I feel personally some pleasure in seeing and handling the relics of 
the great men of our earth. It may be a silly morbid pleasure akin to the veneration 
some practise for the bones of saints. Still I could have wished the Laplaciana 
preserved at Arcueil, as one might have hoped that the Newtoniana could have 
been collected at Woolsthorpe, like the Galtoniana in this building J. The books, 
papers and personal relics of Laplace were destroyed by fire; portions of the library 
of Newton, which would have told us so much of the writers who had helped to form 
his mind, and •which had been for two centuries preserved unknown to the world, 
were privately sold only the year before last, and the books with the book plate 
“Philosophemur” and with Newton’s autographs and notes appeared unexpectedly 
in half-a-dozen different booksellers catalogues to be dispersed over the world into 
the collections of the wealthy, who had money to buy them ; the remainder is now 
on sale in the hands of a London bookseller. Perhaps on the whole the history of 
science would profit, if we still had some of the mediaeval veneration, if not for 

# Nature , April 2, 1927. 

t This has referenoe to Madame Laplace’s proposal to seU Aroueil (discussed in an unpublished part 
of these lectures) in order to provide funds for the republioation of her husband’s works, which were out 
of print and very scarce. The sale waB prevented by Arago’s Report to the French Government, which 
then undertook to issue a national edition of Laplace’s works. 

X The lectures were delivered in the Galton Laboratory of the University of London. 
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the bones of saints, at least for the papers and books which are the tools of genius. 
We know so little of the life, we know so little of the methods of research of 
Laplace, that a complete destruction of all his papers and relics, before any real 
life of him has been written, is indeed a lamentable loss. 

I have told you all that I have been able to gather of this great Frenchman’s 
life and character. The man who in the first quarter of the nineteenth century 
appeared as a giant among the intellects of that day, who to our own generation 
still stands out as one of the greatest mathematicians of all ages, lacks up to 
the present a critical biographer, who will give him true characterisation as man 
and as scientist. What I have put before you, however inadequately, may perhaps 
suffice to warn you against accepting too readily the statements— unauthenticated 
by documentary evidence—of minor writers on the history of mathematics. The 
history of science, after becoming an academic study, seems to have dropped (in 
the modern spirit) scholarly investigation for the methods of journalism. 
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Corrigendum, Biometrika, Vol. XXI, p. 160. 

Karl Pearson and C. H. Usher: “Albinism in Dogs.” 

In footnote nnder heading No. 3, line 2 for “Darkly pigmented epithelial layers 
of retina” read “Darkly pigmented epithelial layers of iris.” 



LE8 ORIGINES DE LAPLACE: SA G^N^ALOGIE — 

ses Etudes. , 

Pab L’ABBti G. a. SIMON. 

Lbs notices sur l’iUustre mathematician Laplace fourmillent. d’erreurs, en ce 
qui conceme ses origines et ses d6buts. On affirme qu’il appartenait it une famille 
pauvre, qu’il fjut form6 aux sciences math4matiques par son oncle Louis, qui 6tait 
prStre, qu’il flit 61bve de l’&jole militaire de Beaumont-en-Auge, etc. Autant 
d’assertions inexactes, que Ton va s’efforcer de rectifier & 1’aide des documents. 

§ I. GriNtfALOGIE DE LA FAMILLE DE LAPLACE. 

Nous avons dress6 cette genial ogie k l’aide des registres conserves dans les 
mairies de Bourgeauville, Criqueville, Angerville, Grangues, Beaumont-en-Auge 
et DozulA Nos renseignements sont done puis6s aux sources et nous foumissent 
des 616ments certains sur les h4rddit6s du savant et le milieu oil il a grandi. 

Le premier ascendant certain de Laplace est Olivier de Laplace, qui vivait il 
Bourgeauville* en 1645, et qui appartenait certaineraent k une famille notable, 
car nous le voyons, en cette m£me ann£e, 1645, agister k Bourgeauville, au manage 
de Pierre Lambert, 4cuyer, sieur de Saint-Mars, avec Ang61ique de Montgommeiy. 

D’ou venaient ces de Laplace ? Je l’ignore. Ils n’appartenaient pas pr£cis6ment 
& la noblesse, mais a cette aristocratic terrienne qui faisait presque figure de no¬ 
blesse dans nos paroisses rurales, et s’alliait souvent avec ell©. Il me semble assez 
vraisemblable que les de Laplace de Bourgeauville Staient de m6me origine que 
les de Laplace de Rouen, qui foumirent, des le xvi® si&cle, des Conseillers au 
Parlement de Rouen. J’ai remarqu6, au cours de recherches d(jja longues, sur les 
families du Pays d’Auge, que nombre de ces families se retrouvent it Rouen. 
Bourgeauville, membre de l’Rlection de Pont-l’fiveque, appartenait k la G^n4ralit£ 
de Rouen, et les rapports de commerce aussi bien que les rapports administratifs 
6taient frequents entre Rouen et le Pays d’Auge. 

La branche des Laplace de Rouen, la plus anciennement connue, portait: d’azur 
& 3 molettes d’or. Une autre branche rouennaise, celle des Laplace, sieurs de 
Fumechon, portait: d'azur a la molette (alias d, I’itoile) d’or surmontie d’un lambel 
d’or —ou encore: de mime avec I'itoile d’argent, ou bien: d’azur & 3 trifles d’or. 
Comme dans beaucoup de families bourgeoises, les armoiries ne sont pas tr&s fixes. 
Nous ne savons pas d’ailleurs si les de Laplace de Bourgeauville avaient conserve 
le souvenir d’armoiries de famille. 

Dans les actes le nom est dcrit: de Laplace, de la Place, Delaplace, Cette 
derni&re forme est plus rare avant la Revolution. C’est oependant celle qui a 
subsist^ dans les branches collat6rales de la famillef. 

* Calvados, canton de Doeull, airondissement de Pont-lAvtqne. 

t Nona avons adopts, poor plus d’unitd, la forme “de Laplace” poor les degrAs anWrieurs A la 
Revolution. 
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I. Olivier de Laplace* avait 6pous6 Anne Follebarbe , qui appartenait k une 
vieille famille du Pays d’Auge, toujours repr6sent6e. Ils moururent Tun et Tautre 
en 1680, 4 Bourgeauville. Olivier de Laplace est toujours qualify “Maitre,” ce qui 
suppose une situation notable. De ce mariage naquirent an moins six enfants: 

1° Mtre Francois de Laplace , qui suit. 

2° Jacques de Laplace , ne vers 1039, d£c£d£ a Bourgeauville, le* 8 octobre 
1728, a luge de 90 ans. II avait 6pous£ sa cousine: Jacqueline de Laplace, 
dont il eut au moins une fille: Marie, marine en 1705 k Charles Jourdain, 
dont elle etait veuve en 1730. 

3° Marie de Laplace , mariee en 1658 a Simon Le Coq. 

4° Charlotte de Laplace , baptis^e k Bourgeauville le 26 janvier 1642. 

5° Ay me de Laplace , baptise a Bourgeauville le 15 novembre 1645, et 
decide en 1706. 

6° Jacqueline de Laplace , baptis^e a Bourgeauville le 14 f&vrier 1649. 

II. Francois de Laplace , qualifie “Maitre,” comrne son pere, fut baptist h 
Bourgeauville le 5 avril 1638. II y epousa, en 1666, demoiselle Barbe Belot . II 
dut mourir k Beaumont-en-Augc+ fort ag^, car nous l’y trouvons en 1731. 

De ce mariage naquirent au moins quatre enfants: 

1° Mtre Olivier de Laplace , qui suit. 

2° Nicolas de Laplace , nd vers 1685, d£c6de k Beaumont le 16 aotlt 1735, 
et inhum<$ en presence de son fils: Jacques de Laplace . 

3° Martin de Laplace , mari6 a Bourgeauville en 1720 k Marie Lesnis , fille 
de feu Francis Lesnis et de Jeanne Le Pecq. Les Lesnis sont une ancienne 
famille du Pays d’Auge, qui s’est alliee notamment aux Ch6ron, sieurs du 
Fresney, et les Le Pecq ont donne naissance, k la fin du XVlii* si&cle, k Louis 
Le Pecq, sieur de la Cloture, c£l&bre m^decin, anobli par Louis XVI. 

4° Simon de Laplace , dont la branche sera indiqu6e dans une section 
sp^ciale. 


(A) Premilre branche . 

III. Maitre Olivier de Laplace , chirurgien royal, n6 vers 1669, et d6c6d6 a 
Bourgeauville en avril 1736, fig6 de 67 ans 18 jours. II fut inhum6 “dans r^glise,” 
privilege r6serv6 aux seigneurs et aux notables. II 6pousa: 1° Marguerite Gardin> 
2° Charlotte Br4ard , fille de Jacques Brdard et d’Anne Le Petit. Ce dernier 
mariage eut lieu h Bourgeauville le 12 mai 1704. Les Qardin paraissent origi- 
naires de S. ^tienneda-Thillayet. Ils 6taient allies aux Le Cordier et aux Isabel, 
riches families de la region §. Les Br6ard dtaient 6galement bien pos6s. 

* Pour plus de clart4, none avons 6crit en majusoules les nomB des anefitres directs du Bavant. 

t Canton et arrondissement de Pont-l’fiv6que (Calvados). 

t Canton et arrondissement de Pont-l’Ev&qne. 

§ H. Le Court: Qtin&alogie de la famille Le Cordier , seigneurs de MaloUel % p. 52. 
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Mtre Olivier de Laplace laissa de son premier manage: 

1° Jean de Laplace , qui suit. 

2° Marie-Anne de Laplace , baptis^e it Bourgeauville le 26 d6cembre 1697. 
Elle 6pousa, en cette meme paroisse, le 27 septcmbre 1723, Maitre Nicolas 
Le Carpentier t originaire de Criqucville*, fils de Jacques Le Carpenticr et de 
Catherine Bouet, qui devint Conseiller du Roi. Ils eurent pour enfants: 
(a) Claude-Fran^ois Le Carpentier, Conseiller du Roi, Maitre Particulier des 
Eaux et Forets, au baillage d’Auge, marie en 1767 k Marie-Anne-Catherine 
Cambremer, de la famille des Gambremer de Croismare . ( b) Jacques-Charles 
Le Carpentier, Conseiller du Roi, seigneur et patron de Putotf, lieutenant en 
Election de Pont-H-Cvoque, mari6 en 1758 fi Fran^oise-Catherine de laTaille. 
(c) Marie-Catherino Le Carpentier, marine en octobre 1754 k Marin Barbey, 
bourgeois de Caen, de la famille des Barbey de Longbois. 

3° Pierre-Daniel de Laplace , baptist a Bourgeauville le 13 novembre 1698. 

IV. Jean de Laplace 6pousa Anne Toutain . Ils habitaient Bourgeauville. Ils 
eurent au moins neuf enfants: 

1° Jean-Frangois de Laplace , baptist k Bourgeauville le 11 avril 1719. 

2° Jacques de Laplace , baptist k Bourgeauville le 9 novembre 1720. 

3° Robert-Olivier de Laplace , baptist k Bourgeauville le 14 mars 1722. 

4° Robert de Laplace , baptist k Bourgeauville le 17 aoftt 1723. II devint 
pretre et chapelain de Brucourt. 

5° Anne-Frangoise de Laplace , n6e en 1726. 

6° Robert-Jacques de Laplace , baptist k Bourgeauville le 14 mars 1727. 

7° Frangois de Laplace , qui suit. 

8° Marie*Anne de Laplace , baptis6e k Bourgeauville le 17 mars 1731, 
d6c6d6e le 17 juin 1735. 

9° Jean-Baptiste-Frangois de Laplace , baptise k Bourgeauville le 27 aoflt 
1732. 

V. Frcmgois de Laplace se fixa k GranguesJ, puis k Criqueville§, oil il fut 
inhum£ le 18 fSvrier 1777. II avait 6pous6 Marie-^lisabeth Morin , dont il eut 
au moins neuf enfants: 

1° Anne-Jeanne de Laplace , n6e k Grangues vers 1751, d6c6d6e en 1809 
k Angerville||, marine successivement it Jean de la Rue et it Henri Loriot 

2° Jacques de Laplace , present avec ses fr&res it Tinhumation de son beau- 
frfere Jean de la Rue, k Criqueville, le 20 octobre 1779. 

* Canton de Dozuld, arrondissement de Pont-l'JiWGque. 
t Canton de Dozuld, arrondissement de Pont-risque. 

X Canton de Dozul6, arrondissement de Pont-l*6v6que. 

§ Canton de Dozuld, arrondissement de Pont-l’Evdqne. 
it Canton de Dozul6, arrondissement de Pont-l’Ev£que. 
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3° Nicolas de Laplace , qui suit. 

4° Francois de Laplace, baptise k Criqueville le 3 septembre 1762. 

5° Marie-Ft'anfoisc de Laplace, baptis6e a Criqueville en 1764. 

6° Gather ine-Fiticiti-Perpetue de Laplace, baptisee h Criqueville le 26 no- 
vembro 1760. 

7° Jean-llaptiste de Laplace, baptise k Criqueville le 29 d6cembro 1772. 
II s etablit a Grangues, et epousa Marguerite-Fraupoise Marline. II est qualify 
“ proprietairc.” II eut pour enfants: (a) Desiree de Laplace, n£e a Grangues, 
nmriee h Baptiste Delaplace son parent, dont j’ignore l’origine. ( b) Pierre - 
Jean Frangois-Hyppolite de Laplace qui £pousa en premieres noces: Marie - 
Anne-Elisabeth Philippe, d6c£d£e le 5 novembre 1816; et en secondes noces, 
ii Dozule, le 24 novembre 1824, Rose-Julie Lelihre, n6e k Gerrots*, fille de 
Pierre Lelievre et de Marie-Anne Prentoutf. De cette seconde union naquirent 
trois enfants: Julie-Alinda , Jean-Desiri-Eugene, Henri-Edmond. La premiere 
et le dernier moururent jeunes. J’ignore la destin6e du second, n6 k Dozul6 
en 1830. 

8° Marguerite de Laplace, cit6e coramc t^moin dans divers actes. 

9° Franpoise-Ther&se-Adila/ide de Laplace, fille posthume,baptisee aCrique- 
ville le 17 mai 1777. 

VI. Nicolas de Laplace (alias Delaplace), 6tabli k Grangues, cite en divers actes 
avec ses freres, epousa Adelaide Marline, dont il eut deux fils: 

1° Nicolas (?) Delaplace, counu dans la famille sous le nom de “Taini 
Delaplace,” d6cdd6 a GoustranvilleJ. II n’eut qu’unc fille: Laurence Delaplace , 
mariee ii M. Sauvage , dont deux filles d£cdd6es sans post6rit6. 

2° Stanislas Delaplace, qui suit. 

VII. Stanislas-Casimir Delaplace, ne k Grangues le 10 mars 1796 et d6c£d£ k 
Dozule le 15 novembre 1875. II avait epous6 Marie-Anne-V6roniquo Leneveu , fille 
de Jacques-Fran^ois-Pierre Leneveu et de Marie-Anne-F£licit6 Leroy, d6c£d6e 
k Dozul6 le 15 d^cembre 1882, dont: 

1° Abilina Delaplace, n^e k Criqueville, marine ii Auguste Courage , dont 
post6rite§. 

2° Eughie Delaplace , qui suit. 

3° Edmond-Constant Delaplace, domicilii k Angerville, mari6 k C^leste- 
Emestine Lefbvre, dont: (a) Jeanne, morte jeune; ( b ) Fernand , mort jeune; 
(c) Lion, domicilii & Beaumont; (d) Juliette, marine k M. Dubois. 

4° Jules Delaplace, mari6 k N., dont: («) Jeanne, marine & Henri 6ven 
(post6rit6); (b) Henri ; (c) Andri, mort k 17 ans, k Pont-r6v5que, le l or avril 
1924. 

* Canton de Cambremer, arrondissement de Pont-l’Ev$que. 

t La famille Prentont est anoienne an Pays d’Auge. L’un de ses reprdsentants, M. Henri Prentout, 
est aujourd’hui professeur d’Histoire de Normandie. & la Faculty des Lettres de Caen. 

X Canton de Dozul6, arrondissement de Pont-l’Evdque. 

§ Cette posterity est repr6sent4e par Mmo Aimable Moulin et par M. Henri Couraye, maire de Dozule. 



L’Abb^ G. A. Simon * 221 

VIII. Eugkne-iZtienne- C asimir Delaplace, n6 & Criqueville, mari6 k Louise- 
Emma Gosse, dont: 

1° Jules Delaplace, mari6 k Ang61ine Dasseville , dont un fils, mort jeune. 

2° Georges Delaplace , qui suit. 

IX. Georges Delaplace , mairo do Leaupartie *, ddl6gu6 cantonal de Cambremer, 
officier du m6rite agricole, n6 k TrOarn, mari6 a Montreuilf, le 22 mai 1883, k 
Caroline-Ad61ai*de Goupil, fille d’Adolphe Goupil et d’Euphrasie Martin des Fon¬ 
taines . M. G. Delaplace habite actuellement le manoir de Leaupartie, II a un fils, 
qui suit: 

X. Maurice Delaplace , a Leaupartie le 21 juillct 1884, brigadier d'artillerie 
en 1915—1919, Croix de Guerre, mari£ & Montreuil, le 9 janvier 1925, k Madeleine 
Laval, fille de Pierre Laval ct de Mrne, nee Soulier, dont: 

1° Georgette Delaplace , n6o a Leaupartie le 18 juin 1926. 


(B) Seconde branche. 

III. Simon de Laplace , fils de Francois de Laplace et de Barbe Belot, in- 
diquds plus haut, s^tablit a Beaumont avant 1743. Eh 1738, nous le trouvons k 
Bourgeauville, parrain de Thomas-Fran^ois, son neveu, et le 22 d^cembre 1732 
il est parrain & Criqueville de sa petite niece Marie-Catherine Le Carpentier, fille 
de Nicolas, Conseiller du Roi, et d’Anne de Laplace. II avait 6pous6 Marie Viel , 
dont : 

1° Marie de Laplace , marine k Beaumont, le l er juillet 1743, k Maitre 
Robert Carrey , m6decin k Lisieux, paroisse St Germain. Les Carrey sont 
une trfes ancienne famille Lexovienne qui a donne des notaires royaux, des 
avocats, des m&Iecins. Le manoir Carrey est aujourd'hui Tune des plus cu- 
rieuses vieilles maisons de la ville. 

2° Louis de Laplace, pr5tre, chapelain de Criqueville, dont nous aurons 
Toccasion de parler. 

3° Pierre de Laplace , qui suit. 

4° Simon de Laplace, cite5 avec ses freree au manage de Robert Carrey et 
de Marie de Laplace. 

5° Thomas-Franfois de Laplace, chirurgien, d6c6d6 k Bourgeauville, a l’Age 
de 27 ans, en 1738, mari6 k Marguerite Hervieu, dont: Thomas-Franfois, 
baptist k Bourgeauville, le 24 avril 1738. 

6° Probablement: Genevieve , marine avant 1745 k Jacques Mahon. 

* Canton de Cambremer, arrondissement de Pont-riiJvdqqe. 
t Canton de Cambremer, arrondissement de Pont-l’J&vftque. 
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IV. Pierre de Laplace , demeurant k Beaumont, sindic de la paroisse, mari6 
a Tourgoville*, )e 6 juillet 1744, a Marie-Anne Sochon , fille de feu Louis-Roberfc 
Sochon ct de Marie-Anne Le Chevalier . De ce inariage sont n6s: 

1° P 1 ERRE-Sl MON 1)E L A PLACE, qui suit. 

2° Marie-Anne de Laplace , baptist a Beaumont le 15 juin 1745. 

V. Pierre-Simon , cointe, puis marquis de Laplace , n6 h Beaumont le 23 mars 
1749. C'est rillustre savant auquel scront consacr^es les pages qui vont suivre. 
Marie a Marie-Charlotte de Courty de Romanges (famille de Bcsan<;on), il en eut 
un fils et une fille: 

1° 67/ arles- $ mile P. marquis do La Place , n6 15 avril 1789, mort 
27 octobre 1874, general de division, s6nateur, Pair de France, Grand-Croix 
de la Legion d’honneur, chevalier de St Louis. 

2° Sopuie-Suzanne de Laplace , morte 1813, en suite des couches de sa 
fille, inari6e k Adolphe-Fraii(;ois-Ren6, marquis de Fortes, Pair de France, 
dec6d6 a Paris le 22 septembre 1852f. Us eurent une fille: Ang61ique- 
Jos£phine-Charlotte de Portes, mariee k Napoleon-Joseph-Auguste, Comte 
de Colbert-Chabannais, decede le l wr octobre 1883. Le second fils issu de ce 
mariage: Pierre-Louis-Jean-Baptiste, Comte de Colbert, releva le nom de 
Laplace, en vertu dun d^cret de 1876. 11 ^pousa en 1882 d® Uo Renault, dont 
post6rit6. Adolphe-Fran£ois-Ren6, marquis de Portes, dun 2 Iue mariage avec 
Caroline Hutton, am6ricaine, eut deux filles: 1° Catherine-Mery-Adolphine 
de Portes, mariee en juillet 1846 a Napoleon-Victor-Eugene, cointe de Bellune, 
dec6d£ en 1852, et ensecondes noces a Charles-Eugenc-Henry-Joseph-Texier, 
marquis d’Hautefeuille, et 2° Madame de Montgomery. 

§ II. Les Sochon, ancetres maternels de Laplace. 

Les Sochon £taienl primitivement originaires de Vauville, canton de Pont- 
Tlllveque. C’dtaicnt riches cultivateurs, dont la branche la plus connue est celle 
des Sochon de Lavigne, trfes proche parente de la mbre de Laplace. 

Nicolas Sochon habitait Vauville en 1669. II 6pousa Charlotte Congnet% f 
d’une famille de Tourg6ville, ce qui sans doute arnena cette branche k se fixer k 
Tourg^ville. 

L’un des fils de Nicolas, Antoine Sochon, 6pousa k Tourg^ville, le 16 juillet 
1669, Marguerite Cofin, dont il eut entre autres enfants: Robert-Louis Sochon. 

Robert-Louis 6tait n6 k Tourg6ville et y avait 6t6 baptist le 8 janvier 1687. 
11 6pousa Marie Le Chevalier, dont il eut deux filles. 

* Canton et arrondissement de Pont-l’lWque. 

t Pour remployer la dot de sa femme au profit de Penfant mineure, M. de PorteB aoheta en 1818, sur 
le oonBeil de Laplaoe et de Napoleon lui-mfime, le oh&teau de Mailloo, qui fut d^truit en 1928 par une 
inoendie, avec tons les effete appartenant au grand Laplace. 

X Les Congnet sont aujourd’hui repr4sent4s par M. Congnet, rddacteur au Minist&re des Finances, 
dont la mdre dtait soeur de Mme GeorgeB Delaplace, de Leaupartie. 
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L’une de ces filles, Marie-Anne, 6pousa Pierre de Laplace. C'esfc la nafcre de 
notre savant. 

L’autre, Anne Sochon, fut marine, k Glanville*, le 30 mai 1752, k Francois 
Cordier, Commis pour le Roi au Grenier et Magasin k sel de Danestal, fils de 
Frangois-Jacques Le Cordier et de Marie-Anne Gondouinf. Ils eurent pour fils: 
Louis-Frangois Cordier, directeur de la Compagnie des Indes, puis regent de la 
banque de France, sous le premier Empire. II 6tait tr&s li6 avec son cousin germain 
Pierre-Simon de Laplace. II 6pousa k Paris, le 15 janvier 1788, Frangoise-Jeanne- 
l£lisabeth Duclos, fille de Jean-Baptiste Duclos, avocat au Parlement, et d’Anne- 
6lisabeth M6nard. 

Ils eurent deux filles: 1° Louise-Itllisabeth, n6e k Caen en 1788, marine en 1812 
k Louis-Frangois Marchand, chevalier de la L6gion d’honneur. 2° 6lise, marine k 
Jean-Baptiste-Michel, baron de Tr6taigne, dont la post6rit6 existe encore. 


§ III. Le milieu familial. 

Nous pouvons, gr&ce aux donn^es prtSc&lentes, avoir quelque idee du milieu oil 
grandit Laplace. Ce n’est pas un milieu vulgaire. Les ancetres ont 6t6 des gens 
distinguAs. Sans doute le pere ne semble pas avoir poursuivi d etudes comme ses 
deux fr&res, Louis et Thoraas-Frangois, le pretre et le ehirurgien, et comme le 
granduncle, Maltre Olivier, ehirurgien royal, mais 6videmment il a acquis au 
foyer une certaine distinction, et e’est pourquoi sea compatriotes de Beaumont le 
choisiront pour sindic. 

Au nombre de ses plus proches, Pierre-Simon rencontre, des son tout jeune-age, 
labbe Louis de Laplace que les biographes nous montrent comme un mathematicien 
distingu6, et son autre oncle Robert Carrey le nnSdecin, qui appartient k une famillo 
c61ebre. 

Maitre Nicolas Le Carpentier, le cousin germain du pfere, cst Conseiller du Roi; 
c est done un personnage de marque, et ses deux fils, tous les deux Conseillers du 
Roi, Tun Maitre des Eaux et Forets, l’autre lieutenant en Election et seigneur 
de Put6t, sont Avidemment des gens instruits, des hommes du mondo, dont la 
fr6quentation ne pouvait que contribuer k former Tesprit du petit Laplace. 

Du cflt6 maternel, il y a roncle Cordier, qui mourra en 1757 et sera inhum6 
dans la nef de TAglise de Danestal. Lui aussi est un homme instruit et distinguA 

Le futur savant a done grandi dans un milieu bien capable de former son esprit 
ct de lui donner le goftt de la culture. Il faut done absolument rejeter la fable du 
petit indigent, fils d’un pauvre laboureur, 61ev6 grace k la charitA des moines de 
Beaumont. 

* Canton de Pont-r6v6que. 

f La Q6ntalogic des Le Cordier, seigneurs de Maloisel, a M Norite par M. Henry he Court. Les 
Cordier et lee Le Cordier 6taient de m&me famille. 
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§ IV. Le college de Beaumont. 

La maison dc M. de Laplace* * * § , le pere, s’616vait prfes du prieur£ de Beaumont,oil 
les inoines Benedictins avaient fondo un college. L'enfant leur sera bientdt confix. 
II nest done pas indifferent dc faire connaissance avec ce nouveau milieu. 

La premiere idee d’un college k Beaumont date du d6but du XVIII 6 si6cle, 
alors quo le pri<;ur6 etait encore en commende sous Denis-Fran^ois Bouthilier de 
Chavigny. L’un des moines, Dom Julien Aubr^e, resolut de s’occuper de l’6ducation 
de ()uel(pies enfants, Beaumont se trouvant eloign^ de tout college. Voici ce que 
dit de lui le savant Dom Martene: “Dieu lui avait donn6 un talent particulier 
pour bien enseigner les humanites aux enfants. litant au monastere de Beaumont- 
en-Auge, les peres et meres dalentour lui envoyaient leurs enfants pour apprendre 
de lui le latin. II avait un grand soin de les former en meme temps k la pi6t& 
A lui seul, il enseignait toutes les classes et il forma de trbs bons 6coliersf.” 

La premiere 6bauche prit forme grace a la protection du due d’Orleans, heritier 
des fondateurs du prieure. Celui-ci obtint du Roi que le monastere serait remis en 
regie, c.-a-d. n’aurait plus de prieur commendataire, mais un prieur r^gulier et que 
la “ mense prieurale,” ou portion des revenue reserve'e au prieur, serait r6unie k 
la mense conventuelle, le prieur-moine n’ayant pas de traitetnent particulier. La 
suppression de la mense prieurale eut lieu en 1731. Le college fut definitivement 
drig6 en 1741J. Il devait etre dirig^S par les douze moines du prieurA Le prieur 
avait surtout la direction spirituelle. Il y avait un regent s’occupant des Etudes. 

Les pensionnaires devaient appartenir uniquemenb aux paroisses relevant du 
domaine du due dOrl&ms. Il y avait des internes, payant pension, plus six jeunes 
gentilshommes, dont la pension 6fcait payee par le prince. 11 y avait bgalement des 
externes, dont la pension etait gratuite, d apres les statuts memos r6dig6s par la 
volonte du due d’Orleans. 

Les enfants pouvaient etre admis k Inge dc sept ans. On lie les prenait pas 
apres douze ans. Les hurnanit6s proprement dites commem;aient en cinquibme. 

La maison 6tait bien r6put6e, et Dumoulin, <^ui en 1764 publiait sa Qiographie 
dela France , y 6crivait: “Les B6n6dictins ont un beau College k Beaumont§. ,> 

Le petit Laplace habitait, corame nous Tavons vu, tout pres du prieurA On 
nous dit, ce qui est extrdmement vraisemblable, qu’il avait “une intelligence 
pr^coce,” “des dispositions peu communes k un Age oil les enfants commencent 

* Note par M. le Comte A. de Colberfc-Laplace: La famille Laplaoe avait habits la terre da M4rteier, 
que je poesMe encore, depute quand y 6taient-ils? je l’ignore,—la maison a des chemtetees du xiv ou 
xv si6ele. ‘Dans les papiers qui je poss4dais, et qui sonfc brAtes, je me rappeUe avoir vu qu’ils avaient fieflte 
k N...cette terre, que Laplaoe a rachet^ plus tard (179-), par rachat des rentes que la famille en tirait. 
Mais je me souviens qu’on m’a dit que Laplaoe 4tait n 6 au M4rteier. Cette terre est du reste sur la 
Commune de Beaumont—mais k 2 ou 3 kilometres du bourg. La maison ok est apposde une plaque en 
face del’^glise, est une maison neuve ou relativement. Je ne pense pas que si les Laplaoe ont "habits 
oette maisou, ce soit la m&me. 

t Vie desjustes, 4dit4e par Dom Heurtebize. Paris, 1926, t. m. p. 84. 

X Les pieces oonoernant oette affaire se trouvent dans le Gallia Christiana , t. xi, Instrumental 

§ T. u. p. 177. 
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k peine k aborder les premiers ElEments de la lecture ” et surtout“ une prodigieuse 
mEmoire*.” A la maison paternelle, ces heureuses dispositions pouvaient fitre 
entretenues surtout par Toncle Louis, autrement dit: 1’AbbE de Laplace. Celui-ci 
habitait Beaumont, peut-Etre la maison paternelle, peut-Etre le prieurE. II n'Etait 
pas moine, mais les religieux, trop peu nombreux, faisaient appel aux professeurs 
et aux rEpEtiteurs ecclEsiastiques ou laiques. Co qui rend vraisemblable la sup¬ 
position que 1’abbE Louis de Laplace faisait la classe au prieurE, c’est qu'il n’apparait 
jamais remplissaat une fonction ecclEsiastique. On le trouve a Beaumont comme 
sous-diacre en 1745, comme diacre en 1746. II fut vraisemblablement ordonnE 
pretre vers 1747. En 1752, il fut nommE chapelain de Criqueville. La chapelle 
de Criqueville situee " au costE gauche du choeur de l’eglise paroissiale ” Etait un 
“bEnEfice simple, 1 ” c.-&-d. sans charge d’&mes et n’obligeant pas k residence f. 

C’est de cet oncle que parle Boisard, lorsqu’il Ecrit: “Le jeune Laplace...re§ut 
les 1690 ns d’un de ses oncles, prEtre et mathEmaticien fort instruitj.” II est 
probable en effet que 1’abbE Louis de Laplace inculqua son goftt pour les sciences 
k son jeune neveu, mais il ne put le pousser trbs loin, car il mourut en 1759§. 
L’enfunt n’avait alors que dix ans. 

Il est probable que le jeune Laplace commensa ses Etudes rEgulieres au college 
k lage de sept ans, qu’il atteignait il la fin de mars 1756. Il y sera entrE, a la 
rentrEe d’octobre 1756. Le prieur k cette Epoque Etait Dom Joachim Hubert de 
Bailleul. O’Etait un homme instruit, qui auparavant, notamment d’apres des docu¬ 
ments de 1741 et de 1746, avait EtE regent des Etudes. 

Les ElEves se destinaient les uns k 1’armEe, les autres k la robe; d’autres enfin 
k l’Etat ecclEsiastique. Les premiers portaient un uniforme railitaire, les seconds 
un vEtjement bleu-de-roi, k revers et parements d’Ecarlate et Epaulettes d’or. Ils se 
coiffaient d’un chapeau k plumet blanc. Les pensionnaires ecclEsiastiques portaient 
un habit noir, conform© k leur future profession. Les deux derniEres categories 
formaient le contingent le plus nombreux. “ Nous voyons avec satisfaction, de- 
clarent les BEnEdictins de Beaumont, nos ElEves remplir les cures voisines et 
occuper les charges de judicature dans les environs||.” 

La famille du petit Laplace le destinait k 1’Etat ecclEsiastique U*. Peut-Etre 
y avait-il pris gotlt lui-mEme au contact de son oncle Louis. L’enfant portait done 
l’habit noir, et sans nui doute Etait exteme. 

La classe du matin commen 9 ait k 7 h. f et finissait k dix heures moins un quart. 
On se rendait alors k l’Eglise pour la messe; on se rEunissait ensuite k la sail© 
d'Etudes jusqu’k 11 h. heure du diner. 

L’aprEs-midi, la classe recommen^ait k 2 h. et durait jusqu’k 4h. Il y avait 
alors collation et rEcrEation, puis on travaillait k la salle d’Etudes de 4 h. f k 6 h. 

# L. Poiaeux: Notice* *ur Malherbe , Laplace etc . Often, Laporte, 1847, p. 84. 

f Archive* du Chapitr* de Bayeux. Insinuation* du dioclse de Litieux , Begiatre xxiii, No. 818. 

$ Notice* biographique*, UtUraire* et critique* sur le* homme* du Calvados . Often, Pagny, 1848, 
p. 177. Voir anaai H. Le Oonrt: G6n6alogie de la famille Le Cordier t p. 46. 

f Insinuations du dioc. de LUieux , Beg. xxvn, No. 212. 

|| Archtoe* du Catoado*: College de Beaumont. IT Boisard, op. ciu p. 177. 
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Lc dimanche, on assistait d’abord k une messe basse, puis il y avait 6tude pour 
les devoirs de classe. Durant cette 6tude, les Aleves se rendaient par groupes pr&s 
des “gardens de chambre” pour se faire friser, poudrer et ajuster. Ensuite avait 
lieu la grand messe, suivie d’une instruction sur l’^pitre ou n£vangile du jour. 
L’apres-midi, on assistait aux Vepres, apres quoi, il y avait recreation, puis 6tud& 
de 5 h. \ jusqu’au soir. 

Les vacances duraient six semaines et commen 9 aient vers le 8 ou le 10 aofit. 
La veille avait lieu la distribution des prix, qui 6tait toujours trfes solennelle. 
Quelques jours auparavant, on avait organise des Exercices publics en presence 
des notabilit&s locales et des parents, ou Ton posait des “ questions d’alg&bre avec 
equation au l er et au 2 C degr6” et oh Ton faisait des demonstrations sur “la Cosmo- 
graphie, la Fortification, la Trigonometric plane, la Balistique et les diff6rentes 
propri6t6s de Tellipse et de la parabole.” Des questions etaient hgalement pos6es 
sur les Belles Lettres, les auteurs latins etc., car au college de Beaumont l’&ude 
des langues et du latin 6tait fort en honneur. 

Le jeune Laplace avait des aptitudes sp6ciales pour les math^matiques, mais 
sa culture litt6raire allait de pair. Il avait aussi le gofit des beaux arts et en 
particular de la musique que Ton enseignait egalcment au college. 

Il y avait alors de 50 k 60 61£ves. 

Le priorat de Dom Joachim de Bailleul cessa en 1756. L’ann6e suivante, le 
prieur 6tait Dom Jean-Pierre Le Maistre. Un acte de 1760 nous mentionne comme 
religieux presents au monast&re: Dom Francis Theres, Dom Rene du Mesnil, 
Dom Jean M^iel-Bussy, Dom Jean-Charles Foyard, Dom Louis-Charles Gadeau, 
Dom Louis-Salomon Girouard, Dom Mathieu Crucifix. Ce dernier n’6tait que 
diacre, et il sera plus tard professeur k Tiron *. 

Aucun de ces religieux n’a laisse de nom ni dans les lettres ni dans les sciences. 
Ce devaient etre simplement de bons professeurs, le Superieur G6n6ral de la Con¬ 
gregation de Saint-Maur se'tant engage k envoyer des professeurs “capables... 
sages et vertueux ” dont “il r6pondait personnellement.” 

Laplace quitta le college & l’&ge de seize ans+, done aux vacances de 1765. Il 
eut par consequent pour professeur et “r6gent d , humanit6s,” Dom Charles-Antoine 
Blanchard, qui avait 6td envoy4 k Beaumont en 1764J. Ne h R6thel (Ardennes) 
le 20 janvier 1737, Dom Blanchard avait 6tudi6 les humanity k Caen, au college 
du Bois, durant sept ann6es. Il 6tait entr6 ensuite k labbaye de Jumieges, oil il 
avait fait sa profession monastique en 1757. En 1759 nous le trouvons 6tudiant 
la philosophic et la th6ologie a Pabbaye de Sainfc-Etienne de Caen. Il fut ordonn6 
pretre en septembre 1764, et aussi tdt envoy6 k Beaumont, vraisemblablement pour 
la rentr6e doctobre. Depuis lors “on Pemploya presque toujours k Pinstruction 

* Extrait du Nterologe de Vabbaye du Bee , 6d. par R. N. Sauvage (Extr. Bulletin philologique et 
historique, 1924). Faria, 1926, p. 8. 

t Le Fort: Le college et VScole militaire de Beaumont t dans la Revue illustrfc du Calvados^ mai, 
1913, p. 76. 

t Abb4 Porta: Lettres de quelques binedictins . Bruges, 1902, p. 5 (Extr. de la Revue b£n6dictine f 
1902). 
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de la jeunesse.” C’^tait “ un emploi qui avait toujours eu pour lui des attraits*.” 
Dom Blanchard n’a jamais rien public, mais il a laiss6 des Memoires historiques sur 
Vabbaye de Saint-Etienne de Caen, 6dit6es r6cemment par M. R. N. Sauvage, au 
tome xxx du Bulletin de la Society des Antiquaires de Normandie (1915). Le 
nouveau professeur avait 27 ans. II avait du talent et la passion de Tenseigne- 
ment. II put done avoir une certaine influence sur le gofit et la formation litt6raire 
de Laplace f. 

Dans les pages qui pr&s&dent nous n'avons rien dit de T^Jcole Militaire de 
Beaumont dont tous les biographes veulent que Laplace ait 6t6 T61feve. C'est que 
celle-ci ne fut fondle qu’en 1776, dix ans apr&s le depart du jeune 6tudiant. II 
serait peut-6tre bon de mettre fin ii cette erreur tant de fois r6p£t6e. 


§ V. Laplace A Caen. 

Age de seize ans, le jeune homme fut envoy6 k Caen, au College des Arts de 
TUniversit6, afin de poursuivre ses Etudes, toujours a titre d’6tudiant eccl6siastique. 
L&, il devait se perfectionner dans les “ humanites ” et 6tudier particulierement la 
philosophic. 

Le College des Arts se trouvait “ a Tangle do la rue des Grandes-I^coles et de 
la Cour des Cordeliers^, tout proche de Templacement de TUniversit6 actuelle. 
Son nom lui venait de ce qu’il avait 6t6 fonde par la Faculty des Arts. On en 
devait suivre deux ans les cours avant de passer en Th6ologie. Les s6minaires 
dalors n’etaient pas des maisons d’^tudes, mais soceupaient uniquement de la 
preparation spirituelle au sacerdoce. Les 6tudiants eccl&siastiques suivaient les 
cours de TUniversit^, lorsqu’ils en avaient le moyen. Les autres pouvaient dtudier 
prfes du pretre de leur paroisse, quitte a passer ensuite des exatnens. La fortune 
paternelle permettait k Laplace de prendre pension k Caen et d’y suivre les cours. 

Nous sommes en 1766. Le recteur de TUniversit^ est alors Jean-Jacques- 
Fran 9 ois Godard, prStre, licenci^-es-droits, professeur royal d’eloquence et proviseur 
du College du Mont. C’est un lettr6, auteur de quelques poesies et de tragedies 
d’un caract&re scolaire, destinees k etre jouees par les sieves, lors des stances 
litt^raires. En fin de mars 1767, il sera remplacd par M. LevSque, dont on ne 
sait k peu prfes rien et qui ne fit que passer, car en avril de la m§me ann6e le 
rectorat 6tait aux mains de M. Jacques Lentaigne, docteur en th^ologie, cur6 de 
Saint-Sauveur, th6ologien fougueux, qui resigna sa fonction le 29 septembre et fut 
remplac6 par Jean-Baptiste-Alexandre Hardouin, licenci6-&s-droits, proviseur du 

* Porde: VAbbaye du Bee et ses Gcoles. itvreux, 1892, p. 100, et Sauvage: VAbbaye de S . Etienne 
de Caen sous la rbgle de 8. Maur , p. ix. 

f Je n’&i pae trouv4 de programme d’^tudes so rapportant aux ann4ea passes par Laplaoe & Beau¬ 
mont, mais on a public dans le Bulletin de la Society d'histoire de Normandie, t. vii, le programme des 
Exeroioes dee aundes 1770—1773 tp. 862 bb.). Ils doivent 6tre senaiblemeut les mdrnes qu*au temps de 
Laplaoe. 

X F. Vaultier: Histoire de la ville de Caen. Man cel, 1843, p. 162, et C. Pouthaa; Les colUyes de Caen 
au xmi 9 sticle, p. 29. 
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College des Arts* Laplace fut done particuli&rement en rapport avec ce dernier, 
puisqu’il etait charg£ du college dont il suivait les cours. Le professeur de philo¬ 
sophic etait M. Christophe Gadbled, dont nous parlerons plus loin. 

Laplace, en dehors de l’University, fut certainement en rapport avec M. Len- 
taigne, celui-ci, comma nous l’avons vu, 6tant cur6 de Saint-Sauveur. Cette 6glise 
sYlevait tout pres des batimerits du College des Arts. C’est unbel Edifice gothique 
malheureusernent d6par^ par un porche du xviii® siecle, ot plus rnalheureusement 
encore transform^ en halles aux grains. 

NYanmoins d’apr&s une anecdote rapport6e par Puiseuxf, la paroisse du petit 
abbe aurait Notre-Dame-de-Froide-Rue, situ^e k proximity de TUniversit^, 
quoique un peu plus loin que Saint-SauveurJ. Voici cette anecdote: “C’6tait 
sous l’Empire, alors que Laplace 6tait revStu de la haute dignit6 de chancelier 
du S6nat. II parcourait la ville avec notre venerable M. Lair: passant devant 
Teglise Notre-Dame (aujourd’hui Saint-Sauveur), il lui prit fantaisie d'y entrer. 
Le cur6 faisait en ce moment une instruction sur le cat6chisme ii de petite enfants, 
et Tun d’eux, rebelle k la le^on sans doute, avait 6t6 mis en penitence. Laplace, 
qui nYtait pas connu, s’avanya vers Teccl^siastique: Monsieur le cur6, lui dit-il, 
j ai port6 autrefois le surplis dans votre 6glise; au nom de ce souvenir qui m’est 
cher, jc vous demande la gr&ce de ce petit garden. La gr&ce fut accord6e.” Laplace 
avait done pris le petit collet et s’assoeiait au clerg£ paroissial. 

Comment fut-il amen6 k renoncer aux Etudes th^ologiques pour se consacrer 
aux sciences? Voici ce que nous rapporte encore M. Puiseux§: “Un jour des livres 
de hautes math6matiques tombent entre ses mains; il se jette avec ardeur, avec 
passion sur cet aliment nouveau pour lui et vers lequel pourtant Tentraine une 
impt5rieuse sympathie. De ce jour, sa vocation est d^cidee: il s’abandonne sans 
reserve k Timpulsion de son g6nie ; Achille a trouv6 ses armes.” Le jeune homme 
fut-il influence et guide dans le milieu meme de TUniversit6? 6videmment oui, 
et 1 on cite deux hommes qui paraissent n avoir pas 6t^ Strangers, bienau contraire, 
& sa vocation scientifique. Nous en avons nomm6 un, M. Qadbled; l’autre est 
M. Le Canu. 

Christophe Qadbled 6tait n6 k Saint-Martin-le-Bouillant, pr&s de Villedieu 
(Manche), le 29 novembre 17321|. Il 6tait & la fois philosophe et math^maticien. 
Dans des Conclusiones philosophicce , de 176$, oil il figure comme arbitre, il est 
qualifi£: “ pretre, bachelier en la Faculty de Th6ologie de Paris, Professeur de 
Philosophie au College des Arts de la tr&s c<$lfebre University de Caen, et membre 
de la royale Acad6mie des LettresIL” Dans la France littiraire de 1769 **, il est 

* Eug. Ch&tel: Liste des Recteurs de V University de Caen . Caen, Leblano-Hardel, 1882, p. 47. 

+ Op. cit, p. S6 et p. 62. 

+ Cette mdme paroisse a pris, depuis la Revolution, le nom de Saint*Sauveur, 1’anoien Saint-Sauveur 
ay ant 6t6 supprimd. 

g Op. cit . p. 86. 

|| Le Manuel de bibliographie de FrAre porte la date de 1734. Nous devons la rectification A M. R. N. 
Sauvage, arehiviste du Calvados. 

11 Placard imprimei, Arch. Calv. SArie D, University C onclusion s 
** T. i. p. 74. 
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cit4 parmi les membres de l’Acadimie de Caen, avec cette mention: “M. Qadbled, 
professeur royal de Mathimatiques.” Enfin dans une Thbse de Mathdmatiques 
soutenue en 1772 par Louis-Marin Lancelin, il est indiqui comme president, avec 
cette mention Prisidera M. Christophe Qadbled, pritre...professeur de Philo¬ 
sophic en FUniversiti de Caen..., de Mathimatiques et d’Hydrographie*.” 

M. Qadbled publia en 1779 un ouvrage intituli: Expose de quelques-tmes dues 
v4ritSs rigoureusement demontrdes par les QeomHres et rejeUes par Vauteur du 
Compendium de Physique imprimi it Caen en 1776 f. L’auteur du Compendium 
de Physique se nommait M. Adam. Le titre indique qu *k cette ipoque ii y avait 
des polimiques entre giom&tres et physiciens. En cette m&ne annie 1779, 
M. Gadbled publia: Exerdce sur la thiorie de la Navigation , Caen, in 4°. II 
mourut k Caen le 11 octobre 1782. 

Pierre Le Canu avait d’abord midecin, mais comme son confrere M. Qadbled 
il 8 adonna, sans toutefois abandonner la midecine, aux mathimatiques et k la 
philosophie, et il enseigna ces trois sciences. 

Dans un acte public de 1773, il figure comme “professeur de Medecine et de 
Philosophie au College du Mont de l’Universiti de CaenJ.” 

La Biblioth&que de Caen conserve l’exposi d’un Exerdce sur le Calcul infinite¬ 
simal , imprimi k Caen, chez Poisson, 1788, in 8°, 19 p., se terminant par ces mots: 
“Ripondra M. Pierre-Jacques-Guillaurae Lair, de Caen. Prisidera M. Pierre 
Le Canu, Professeur Emirite et Royal honoraire de Medecine en l’Universiti 
de Caen....Lecteur du Roi et son Professeur de Mathimatiques au College Royal 
de Normandie§.” 

On a de Le Canu un: Compte-Rendu des maladies qui ont rigni pendant 
Vannie 1781 , sur les cdtes de la Normandie depuis la rivi&re de Dives jusqu'au Vey . 
Ce mimoire fut citi avec iloge par la Sociite Royale de Midecine, dans sa stance 
publique du 27 aoftt 1782. 

Laplace suivit les cours de ces deux professeurs, qui furent pour lui “plus que 
des maitres, des amis/’ et il fit sous leur direction “ des progr&s rapides dans le 
doraaine des sciences exactes||.” 

Il ne semble pas que le jeune savant ait pris le degri de Maitre-is-Arts. Il 
est certain qu’il n’entra pas dans la caricature et ne re$ut jamais la tonsure. 

M. Puiseuxfl nous dit qu’ayant quitti le College des Arts, Laplace fut un 
instant pricepteur dans une des branches de la famille d’Hiricy. Il s'agit pro- 
bablement de Philippe-Jacques, marquis d’Hiricy, brigadier des armies du Roi, 
qui avait alors des enfants d’une dizaine d’annies, et possidait un hdtel k Caen. 

‘ M. Puiseux nous dit encore ** qu’avant son dipart pour Paris, en 1768, alors 

* Biblioth&que municipal© de Oaen. Imprimd en 4°, 16 pp. 

f En 8°. Amsterdam (Oaen), 1779. 

% Archives du Calvados, D, University 862. 

§ Biblioth&que de Oaen. Le College royal de Normandie avait remplac4 en 1786 l’anden College 
des Arts. 

|| Puiseux, op. cit . p. 87. IT 0 cit p. 87. ** Loo. cif. * 
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qu’il n’avait environ que 18 ans, Laplace fut “r^p^titeur” au College de Beaumont. 
Ce ne fut Ividemmcnt qne pendant fort peu de temps. Peut-6tre durant quelques 
mois ou raeme seulement quelques semaines preta-t-il le concours de sa science k 
ses anciens maitres, trop peu nombreux pouf le nombre de leurs 61&ves. II faisait 
on memo temps ses prdparatifs de depart pour Paris. 

A Beaumont, depuis 1767, Dom Le Maitre n’6 fait plus prieur. II avait 6t6 
remplac6 par Dom Jean-Baptiste Marage. Les moines ^taient: Dom Alexis Pe- 
tillon, sous-prieur, Dom Marin Gouges, Dom Louis d’Hee, Dom Augustin Patattier, 
Dorn Jean-Pierre Bride, Dom Henri du Doit, Dorn Martin Vatard, Dom Robert 
Le Guelincl, Dom Charles Blanchard, Dom Pierre Chennebault. Peut-6tre, entre 
deux cours de math^matiques, le jeune horn me airna-t-il causer un peu literature 
avec son ancien maitre Dom Blanchard. 

Par ail leurs, les relations £taient rest£es tr&s suivies avec les deux professeurs 
de Caen, et M. Le Canu, en particulier, approuva chaleureusement le projet de 
voyage k Paris. II connaissait quelque peu d’Alembert, et il donna au jeune savant 
pour l’illustre acad&nicien une lettre de recommandation*. 

* Puiseux, op. cit . p. 38. 



STUDIES IN THE THEORY OF SAMPLING. 


By JOSEPH PEPPER, B.A., B.Sc. 

1 . A great deal of work has been done, both in theory and experiment, on 
various properties of small samples from a univariate population, with any law of 
distribution, e.g. the nature of the distribution of the means and variance in samples. 
With regard to bivariate sampled populations, certain properties of small samples 
have been found by Professor Pearson, Dr Fisher and recently by Dr Wishart*, 
but only in the special case when the sampled population is normal. Some of the 
values obtained by them appear in corollaries to formulae of my paper. 

In this paper, I have investigated theoretically the problem of sampling from 
any bivariate population, not necessarily normal or infinite. The method employed 
is purely algebraical and is an extension to two variates of the methods used by 
‘‘Student,” Dr Church and Dr Neymanf. Although the algebra is often heavy 
and complicated, yet the method has the advantage of yielding the result in a 
general form, from which the special cases of univariate, normal or infinite sampled 
populations may be deduced. 

The results obtained may be summarised under the following headings. The 
notation, which is mostly familiar, is defined in the next section. 

(i) The mean values of p u , pia, P 21 , £> 22 , p«i> Pn for the general case of any 
limited sampled population. A special case of these results gives the means of the 
second, third and fourth moment coefficients in samples from any limited population. 

(ii) The standard deviations of pn, pn, p$i in samples from any limited 
population and the corresponding deductions for the second and third moments. 

(iii) The standard deviations of p S i ) pn and ^22 in samples from any infinite 
population, giving the standard deviations of the third and fourth moments. 

(iv) The third and fourth moment coefficients of the distribution of p n in 
samples from any infinite population. 

(v) The fii and /St of the distribution of p n in samples from an infinite normal 
population. 

(vi) Correlations between any two of m g , m v , o-* 2 , <r y 2 and pn in samples from 
any limited population. 

(vii) The various deductions of all the above results for the special cases of 
univariate, normal and infinite sampled populations. 

# Biometrika, Vol. xvn. p. 176; Yol. x. p. 607; Vol. xx A . p. 32. See references given in last paper 
to other writers on this subject. 

t Ibid ► Vol. vi. p. 8; Vol. xvn. pp. 79 and 472. 
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2. The sampled population is bivariate, with variates 

{Xu FiX {Xu F 2 ), ... (X N , 7 n \ 
where we take S (X 8 ) ■* 0, S ( Y 8 ) = 0. 

The standard deviations of the variates are <r x > a Y and the correlation R. 
The product moments, I J ab> °f the sampled population are given by 

Pab = j{S(X, a Y, b ), 

where a, b are positive integers. 

We consider samples of n from this population, where any particular sample 
has variates (x lt y x ), (x 2i 3 / 2 ), ••• (#n> 2/n) and the product moment coefficients in the 
sample are defined by 

p<# = - 8 (*. - *)“ (y. - y) 3 . 

where « = ^ S (*,), y = ^S(y,), 

and o, £ are positive integers. In the later work I have replaced x, y by the 
notation m xt m y respectively. The standard deviations of the variates in the sample 
are denoted as usual by a Xi cr v . 

I have deduced many results for the case of sampling from a univariate popula¬ 
tion by putting X = Y and x = y in the bivariate results. I have then put 

= with 1*1 = 0. 

The standard deviation is In the sample, p a/3 reduces to the moment co¬ 
efficient and I have put 

m q = ^8(x a -x)i. 

The value of m 2 is the variance in the sample, which is usually written s*. 

I have denoted the mean of an expression by placing a bar over it. It must be 
observed that in the notation for the various sums I have indicated different values 
of the variates by giving them different suffixes, e.g. 8{x*x t y u ) denotes the sura of 
terms like x^x 2 y 2i so that s> t, u are combinations of 3 different numbers taken 
from the numbers 1 to n, and there will be frn(n — l)(n — 2) such combinations. 
Similarly S{x?y 9 yi) denotes the sum of terms like {oD£yiy% + xfy%yi)> and here 
there will be £n(n — 1) terms, and so on, for other sums. 

3. (a) The Mean and ( b) the Standard Deviation of pu in Samples from a 
Finite Population with any given Distribution. 

(a) By definition, 

Pu--S(x,-x)(y,-y) 

= S O.y.) - ^ S ( w , y t ) 


(1). 
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Now we have in repeated samples 

MeanS(*,y.)=£,S(X,F.), 

Mean 8 (x,y t ) « 8 (X.Y t ), 

and in the sampled population 

S ( X.7.) = NPu, S (X,Y t ) - - NP n , 

so that trom equation (1), 

—\ 1 f »(* —1) D 1 

Pu n » ( nP u) n ,j If _ 1 Pnj- 

(l--) 

V nJ n 


(>-i) ... 

When we put X =» T and x * y in (A), we obtain the familiar result 

M) 


•(A )' 


?. 


Vs)"" 

(6) From equation (1) 

Pn *■ O . y .)) 2 - 2(W w r - 1) S ( x , y t ) { S (*, y <)) 1 
-~ 3 rSW)+Js(*,V) + 2g >ff*« y ‘> {(«-!)«+ 1 } 


- - n 4- 1} S - ( ^^){2-2(n^l)} 

+ Ji<sr (x.x t y*) +^S(tc*y t y a ) + ^-SK®,y„y») .(2). 

In finding the means of the above sums, the following relations will be required: 

S(X*Y?) = N*P k P m -NPu, ' 

2S(X,F,X j F t ) - N»P u l - NPa, 

SWF,F t ) - 0 (X.X.F,*) = - NPn, 

S(X,Y,X t Y u ) = 2NP„- N»P n « 

2fif(X,*F < F„) = 2S(X,X«F.*) - 2NP« - N*P«P«, 

4S(X,X t Y u Y 9 ) - X J P»P M + 2N*P U *- 6NP«. 

* This result was first shown to me by Professor Pearson, and I reproduce it here for the sake of 
completeness. 
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Taking the mean for all samples from equation (2) 




P*= V 


(nP 22 ) + | i (re-l sl +l)| 


n (n - l)(NPu* - Pm) 
2(N- 1) 




+- J '”> + 1 <’ 1 < 2P » - 

4 «(>! — l)(w — 2) . 0 p »r p p \ j 4 (w — 1) n (n — 1) p 
+ n* 2(tf~T)lN=2 ) { - 22 m > + ““rf - i * 


4 n (n— 1) (» — 2) (n — 3) 


(NP M P „2 + 2NP n 2 - 6Pm). 


w 4 4(A r -l)(iV-2) (N-Sy 

Now using the result for in equation (A) we find, for the standard deviation 
of p U) after reduction, 

^vn^P^-iPnf 

N(N-n)(n- 1) 


n*{N- l)(N-2) (A 


_^ \(N-n-l) PjdPos + (Nji -N-n-l)P ts 

.jifcWil).!,*.!) 


...(B). 


Corollary 1. For samples from an infinite population 


ft — 1 




[P«)Po 2 + (ft - 1) P 22 - (n - 2) P u 2 ]. 


If the sampled population is also normal, P 22 = (1 + 2R 2 ) <r x 2 cr Y 2 } and* 

(n —1> 


^u=( W m8 )(l+^)^W. 


Corollary 2. Putting X = F and oc = y, we obtain the result f 
N (N — ft) (ft —1) 




n 3 (N —l) 2 (N ~ 2) (N —3) 

x[(N7i-N-n-l)(N-l) f n-(N 2 n~3N 2 + GN-3n-3)ti 2 2 ]. 


4. (a) The Mean and (b) the Standard Deviation of pn and p^ in Samples from 
a Finite Population with any Distribution . 

(a) Proceeding as in the case of pn> we have: 

P 2 i= S(x,-xf (y, - y)/n 

= ^ [<S (®«*y.) - (ar,y 4 ) + (y.) - yS (a;,*) + 2^S (<r,) - nx*y], 

which gives on substituting S (x 9 ) * rix , $ (y # ) * ny, 

Pn - ^ [S (®,*p,) - 2®S (®,y,) - (®, 1 ) + 2n®*i/].(3). 

* See Dr Wiahart: “The Generalised Product Moment Distribution in Samples from a Normal 
Multivariate Population/* loc, cit. p. 44 (5). 

t This agrees with the result obtained by Dr Neyman in Biometrika , Vol. xvii. p. 477, equation (62). 
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As in the case of pu, we have to express p a in terms of sums such as 8(afy t ), 
8 (x,x t y u ) and so on. For this we have: 

(x,y,) = \s {x*y,) + \s (x,x t y t ), 

71 71 

* 

yS(x,*) (x,*y a ) + £ (x, 2 y t ), 

® 2 y “ ^ [S (x*y,) + S (x.%) + 2S (x,x t y t ) + 28 (x,x t y u )]. 

Putting these values in (3), 

'■”%[( I -s + ») SW! '- >+ (s-;) i ' W! " ) 

+ („2 -~) s (WtVt) + ~S(x,x t y u ) J ...(4). 

To evaluate now the mean of p& , we have: 

Mean S(x,hy,) = n Mean (X, 8 F.) = ~S (X, 2 F S ) = nP n , 


Mean 8(x*y t ) = ” ( - g - Mean (X, 2 F ( ) = 8(X*V t ) 

“ N-1 

Mean S(x,x t y t ) - + ^^ ) S(XJ t Y t ) = - r ±£^P n , 

h„„ b < w .) - «<W„> - ■ 

Hence, taking the mean in repeated samples, we get, after reduction. 


p a = Pa 


(ijH 
K)K) 


(C). 


The value of pi* is obtained by replacing P& by P M in the above result. 
For samples from a normal bivariate distribution pa *= 0, since Pa ■» 0. 
For the single variate distribution we get from (C) 


- MHiJ 
"'“FUR)' 

where m* is the third moment about the mean in samples of n, and is the third 
moment in the sampled population. 
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When the sampled population is infinite, we have, making N-*- oo in (0), 

**, = />* (l-i) (l-*). 

(b) For the evaluation of o* Pn we require the value of pjj?, or since, from 
equation (4), 

= 4 [(n -1)(n - 2)(S(ay hj,) - (n -2 )S{x a y t ) -2(n-2)S (x,x t y t ) + 4 S(x,x t y u )], 

71 

we have to expand the squares and products of these sums into other sums and 
find the mean values of the latter sums. For this we have: 

[S (ay*y»)] 2 = S (xfy, 2 ) + 2 S (x*y a x?yt)> 

[5 (ay ! y ( )] a = 8 (x*y t a ) + 2 S (x.'y.xfy*) + 2 S {x, 2 x?y 2 ) + 2 8 (x*y t y u ) 

+ 2 8 (x 2 y,x t 2 y u ) + 4 S (x*x t hy u y v ), 

[<S (x K x t y t )] 2 = 8 (x 2 x, 2 y t 2 ) + 28 (x?y,x t *y t ) + 28 (x 2 y„x t y t x u ) 

+ 2S (x 2 x t y t x u y u ) + 28 (x 2 y 2 x t x u ) + 4 S (x,y,x t y t x u x v ), 

[/S (x,x t y u )] a = S {x a x t a y a ) + 28 (x*x t y t x u y u ) + 2<S (x a x t a y u y v ) 

+ 2 S (x*x t y t x u y v ) + 28 (x 2 y t a x u x v ) + 4,S (x,y,x t y t x u x v ) 

+ 4 8 (x,*x,x u y v y„)+ 6 8 (x,y,y t x u x v x„)+12S (x,x t x u x v y v y x ), 
8 (x*y,) S (x 2 y t ) = S (x*y,y t ) + 8 (x 2 y 2 x t 2 ) + 8 (x 2 y s x?y u ), 

S(x 2 y,) 8(x,x t y t ) = S(x a y s x t y t ) + S(x a y a x t ) + S(x a y,x t y,x u ), 

S (x 2 y,) S (x,x t y u ) = S (x a y,x t y u ) + S (x 2 y a x,x u ) + 8 (x a y.x t x u y v ), 

8 (x 2 y t ) S (x,x,y t ) = S (x a x t y t a ) + S (x a y,x t y t ) + 8 (x a y,x u y u ) + S (x a y,x t y u ) 

+ 2S (x a x t y t x u y u ) + S (x, 2 x t y t 2 x u ) + 8 (x 2 x,y,x u y v ), 

8 (ay l y t ) 8(x e x t y u ) - S (x a x t y t y u ) + 8 (ay l y.x,y t x u ) + 8(x a y t a x u ) 

+ S (x 2 x t y t x u y v ) + S (ay l y?x„x v ) + 28 (x a x t y u y v ) 

+ 8 (x a y.x t x u y v ) + 2S {x a y t x u x v y K ), 

8 (x,x,y t ) 8 (x,x t y u ) = 8 (x 2 y,x t 2 y u ) + S (x a y,x t y t x u ) + 8 {x 2 x t y t 2 x u ) 

+ 8 (x a y,x t x u y v ) + 8 (x,y 2 x t x u x v ) + S (x 2 x,y,x u y t ) 

+ S(x,y,x l y t x u x v ) + 3S(x,y,y t x u x v x„). 

In the above expressions there are 28 different sums involved, whose mean values 
are to be found. As before, we can express the corresponding sums in X and Y 
in terms of the product moments, Fob, of the sampled population. For the sake 
of brevity, I have not attempted to show the forking but merely state the.results, 
as follows: 

2S(X a Y,X t a Y t ) m N»P n * - NP a , 

8 (XfYf) «* N a P,u>Pn — N. P«,, 

S(X a Xm - N'P»Pn - NPu, 

28 (X'X? F„») - »P» 2 Po» - W* (P 40 Pos + 2P*P») + 2NP a , 

2S(X*Y t Yj = -N i P t0 P n + 2NP it , 
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8 (X/F, F«)- 8 (X/F, 2 ) - - NP a = S(X*Y.'X t ), 

S(X*Y,X t *Y u ) --N* (P n 2 + P»P«) + 2NP4a, 

4S (X, 2 X t 2 Y u F«) - - X 2 P» 2 P M + X® (4P»P« + P« 0 P M + 2P„ 2 ) - 6A’P«, 
S(X, 2 F,X t F t ) = N'PaPu- NPu, 


8(X*Y,X t Y t X u ) = - X* (P a 2 + PaaPu) + 2XP*, 

2-Sf (X, 2 X ( F(X u F tt ) = - N* (2 P M P U + P„P a ) + 2XP*, 

2S (X, 2 F. 2 X 4 X U ) = - N'PnP* + 2 XP«, 

4<S (X.F.X, F ( X U X # ) — X*Pu 2 P» + 2V 2 (2Pji* + 4PsiPu + P»P*a) - 6XP«, 
<S(X, 8 X t F ( 2 ) = X*PsoP»-XP«, 

£ (X, 8 X t F t F u ) = - X* (P 30 P 12 + P 31 Pu) + 2 AP« 2 , 

/S(X, 8 F,X t F„) = - X’PaiPu + 2XP«, 

/S (X,* Y,X t X u Y v ) = X 2 (P a 2 + P a P u + JP»P 8 ) - 3 NP U , 

8 (X*X t Y?X u ) = -X 2 (PaoPu + P^oPas) + 2XP*, 

S (X, 2 X ( F,X m F„) = - WP, + A* (Pa 2 + 3 PaPu + 2P S0 P ra + P^P^) 

-6XP42, 

S(X, 3 Y?X U ) -X 2 (P 40 P 02 + PaoPn) + 2XP**, 

2S(X. 2 F, 2 X„X„) = - WP„ + X 2 (3P»P M + 2P4oPo* + 2PaoP M ) 


-6XP„, 

2/S (X.^F.F,) = X 2 (2P 81 P 1 , + P 40 P 02 + 2PaoP M >- 6XP4*, 
6S(X,X,F* 2 X u X e ) = X* (2P3oPi 2 + 3P„P a ) - 6XP«*, 
2/S(X, 2 F < X u X 1 ,F w ) - N* (Pn 2 Pao + iP» a P<«) 

— X 2 (4PaiPu + 2P M 2 + 4 PjoP* 2 + P«Pog + 2PaoPij) 

+ 12XP4*, 

12/S (X.F,F t X tt X 0 X tt ) = 5iV 3 Pn 2 P» 

- X 2 (SPmPb + 16P m P u + 6P a *+^P»P«) + 22XP« 
— 24/S(X,X < X tt X„F w F*). 

We are now in a position to write down the mean value of p a a , and from the 
equation 

^pti ~ P*? ~ (Pa) 2 , 

where pa is given by (C), we can find 

After substitution and simplification, I obtained the result in the following 
form: 


Pti 


N(n — 1) (n — 2) (N — n) P«oPo* \, n 2 -12n + 28 8(n-8)(»-6/| 

n*(X- 1)(X- 2) L ( ” ’ " X-3 (X-3)(X-4)J 

. 4X(»-l)(n-2)(X-n)PaoPur^ on («-2)(»-10) 

+ n'{N-\)(N-2) I/ 1- *' N -3 


16 (n — 3) (» — 5) 8 (» — 3) (n — 4) ] 

3 (N -3) (A -4) 3(A-3)(X-4) (X-5)J 
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4A r (?t - l)(ra - 2) (N-n)P al P u 


w®(A)-l)(iV-2) 


(ft- 2) 2 + 


9 n 2 — 42ft + 56 


8 (« - 3) (3n - 10) 

* 7'xr o \ / at a \ i 


N-3 

32 (w — 3) (n — 4) 


(iV-3) (N — 4) 

2A r (ft -1) (n - 2) (AT- ft) Pa, P a 


(AT— 3) (AT—4) (AT 




n*(AT-l)(JV'-2) 


|(n-2)(n- 


3) + 


2 (5ft 2 — 27w + 42) 
AT—3 


2 (ra — 3) (13n — 68) 20(ra-3)(ra-4) 1 

+ (A r - 3) (Af — 4) + (JV-3)(iV-4)(Ar-5)_ 


N 2 (n-\)(n-2)(N-n) P^P,* 
~«® (J¥ -l)(JV-2){JV- 3) 

4AT* (w. -1) (« — 2) (AT- n) P U 2 P*> 


ra®(AT — \){N — 2)(N — $) 


5?[(ra 2 - 

t 


4ft + 12) + 


4 (ra — 3) (n — 6) 


AT— 4 


I 


2 (n 2 — 4n + 6) + 


Af(ra -1) (n - 2)(N-n) P* 2 
~n*(N — 1) (N 


8 (ra — 3) (5 m —18) 

■ * /"»r oV / vr a\ 4" 7 


^~[ ( «-2)(n-6) + 

48 (n — 3) (n — 4) 


(n — 3)(7n- 22) 
AT-4 
10 (w — 3) (n — 4)1 
+ (AT-4)(W-5) J 
15 (n —2) (« —4) 

AT-3 

. 2» (7« - 6) 


(AT - 3) (AT— 4) T (AT - 3) (AT - 4) (A7 - 5) T (W - 1) (AT - 2) (# - 3) 


(n — 1) (w — 2) (AT - ») P« 
n® (AT — 1) 


6 (13ft 2 — 72w+ 100) 

+ “ (S-2)(N-S) + 


2ft (3« - 2) I 

+ (Ar-l)(AT-2)J 
[(» - 1) (* - 2) + 6 - (W v~j ) _ (2 2 W ~ 5) 

8 (ft -3) (23ft -94) 


(AT-2) (AT-3) (AT-4) 
176 (ft-3) (ft-4) 


>] - 


(D). 


(AT— 2) (AT —3) (AT- 4) (AT— 5)J 
It is possible that the above expression may be simplified in some other way, 
but it is in the most convenient form when considering the case of N infinite. It 
will be seen that (N — n) is a factor throughout, as we should expect. 

Corollary 1. When N -+• oo 

-2 _ (ft- 1 )(n- 2) 2 [(n _ 1)i3ia + iPaoPia _ (n _ g) p a 2 + PwPm 


Ptl ' 


(ft -!)(»- 2) 


- 4 (ra - 2) PgiPu - 2 (n — 8) PsoPa] 
[(» 2 - 4ft +12) Px, 2 Pot + 8 (ra 2 - 4» + 6) P U 2 P»]. 


Corollary 2. When AT-*- oo and we take X # *= Y s 


(ft-1) (ft-2^ 


[(ft — 1) — 3 (2n — 5) fit fit — (ft — 10) /a* 2 ] 


3 (n — 1) (n — 2)(3n* — 12ft + 20) .. 

+ 72® *** ' 

where m 3 is the third moment coefficient about the mean in samples of ru 
* This result may be written more conveniently, in terms of the p% 

•v, = (n ~ 1 I!”~ 2) [(« -1) (» - 2) (fit - 16 ) - 8 (n - 2) ( 2 n - 6) (ft - 8) - (» - 2 ) (» - 10) ft+6»«] 
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Corollary 3. When N eo and both variates are normal 

)(i +2 

If in this we put R = 1 and ax — o’y, the case of a single normal variate, we 
have 

6(n-l)(n-2) 


m,- 


<TX> 


which will be the result obtainod when we put the normal values, /i«»15<r^ 6 , 
/*4 = 3<r x 4 , yas *= 0, in Corollary 2. 


Corollary 4. The value of a ^ when N is finite is obtained by putting 
X = Y in (D). As I was not able to simplify the result appreciably by collecting 
the terms in /i*, Ms 2 and /i* 8 , I have not written down the full expression 
for (7*™,. 


5. The Mean of p 3 i in Samples from a Finite Population with any given Dis¬ 
tribution . 


p a i = \s{x,-xf(y,-y) 

1!/ 

= i 5 ( x s 3 — 3xx 2 + 3a? x, — a?) {y, — y) 

= ^ [-S (x 2 y.) - 3xS(x 2 y,) + Sx?S (x.y,)-yS(x 3 ) + 3xyS(x 2 ) - 3nS s y].. .(5). 

Putting * = - 5 (a;,), y = - S (y,) and multiplying out the above products, we 
nn 

get 

xS (x 2 y,) = ^ [5 (a?, 8 y,) + 5 (« a 2 y,« t )], 

x*S(x,y.) = ^ [5 («, 3 y.) + 5 (x 2 x,y t ) + 25 (x 2 y,x t ) + 2 5 (x,x t x u y u )], 

/I 

y8(x 3 ) = l [5 (x 3 y.) + 5 (x* y,)], 

7v 

ay5 (a;,*) = ^ [5 (x s 3 y„) + s (x 2 x t y t ) + S (x 3 y t ) + 5 (x 2 y,x t ) + 8 (x, 2 x t y u )]. 

To evaluate x*y I first expanded oPy and then multiplied each of the resulting 
sums by x\ thus 

a?y = -3 [S(x, 2 y,) + 8 (x 2 y t ) + 28 (x,y,x t ) + 25 (x,x t y u )]. 
n 

We have evaluated xS {x?y>) above, and for the others 

xS ( x 2 y t ) = ^ [5 (x 2 y t ) + S (x 2 x,y t ) + 8 (x 2 x t y u )], 

xS (x,y,x t ) = ^[5 (x 2 x t y t ) + 5 (x, 2 y,x t ) + 2S (®,<r,ir«y B )], 

xS (x,x t y u ) = *[5 (x, 2 x t y u ) + 5 (x,x t x v y u ) + 35 (x,x t x u y v )]. 
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If wo now substitute the values of these products in equation (5) and collect 
terms belonging to the different sums, we obtain 

p 3 i = ^ [(n - 1) (n a - 3» + 3) S {x,*y,) - 3 (n s - 3re + 3) 8 (x*y,x t ) 

+ 3 (2n - $)S(x?x t y t ) + 6 (n - 3) S ( x,x t x u y u ) - (n* - 3» + 3) S(x,*y t ) 

+ 3 (n - 3) <S (x, 2 x t y u ) - 18/S ( x,x t x u y v )] .(6). 

For the evaluation of I obtained the relations 

Mean S (x, a y,) = ^ (IVPa), 

Mean S(x*y,x t ) = 

Mean S (x?x t y t ) (iV a P»P u - NP n ), 

Mean S(x.x,x u y u ) = (2 IVP ffl - N*P M P U ), 

Mean S (x*y t ) = * —“i) 

Mean S(*.'«,y u ) = ^ NPxi ~ N * PM > 

,, 0/ , n(n-l)(n-2)(n-3) (N*P»P u -2NP 3 i\ 

Moan «S y.) - - f) (N-2) (IV-3) ( 2 ) * 

Substituting these mean values in equation (6) and collecting the terms in 
Pai and PaoPn, we find 

^3i= flPsi + B^PaoPn .(E), 

whero 


(?* 2 - + 3) + 


4?i 2 -18rc+ 21 12(rc-2)(w-3) 

" JV_T + (IV-l)(IV-2) 

18 (n — 2) (n — 3) 
+ (IV — 1) (IV —2) (IV - 


t) 1 
— 3)J ’ 


0 IV(«-l)f2n-3 2(»—2)(n —3) 3(n-2)(»-8) 1 /lf , 

P “ n* U-l (IV — 1) (IV - 2) (IV — 1) (IV — 2) (IV — 3)J 

The value of pu is obtained by substituting P 13 and P 02 for P& and P*> in (E). 

Corollary 1. When N -*■ 00 

fta = ” 1 [(« a - 3» + 3) Pa + 3 ( 2 n -3) P»P U ]. 

Corollary 2 . For a normal population Psi = SRo-^ay, Pn = R<r X (r Y , so that 

pzi**3R<T X 9 cry(a -!-#) = Psi(a + #)> 

where from (7) 

2(fi a -3*i + 3) 5(n-2)(n-3) 

. 9 (« — 2)(w —3) 1 

CAT — 1)(IV—2)(IV-3)J ’ 
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Corollary 3. Putting the normal values of P a , Pu in Corollary 1 or making 
N-+ oo in Corollaiy 2, we have for samples from an infinite normal population 


3(»- 1)* . (n — l\ a D 

P*. • — t ~)P*i- 


Corollary 4 The mean value of the fourth moment coefficient in samples of n 
from any population is given by 

mi = afH + 3/3/x* 2 . 

For the cases of normal or infinite populations we can simply put R = 1 and 
a Y i n the first three corollaries; thus 

mi = [(w a — 3 ti + 3) fii + 3 (2 n — 3) p j 2 ] for any infinite population, 

3 (Tx*(& +ft) for limited normal population, 

m 4 = 0-^4 f or infinite normal population. 


6. The Mean of p& in Samples from a Finite Population with any given 
Distribution. 

Pu = \s{x,-xf{y, -yf 

lb 

- \[SWy *)~ 2^ M) - 2 yS{x?y.) + &S (y, 2 ) + y*5 (x?) 
n 

+ 4ixyS (x s y 8 ) — 3 wa?#*]. 

Expanding these products, we get 

(*.y.*) - ” [$ (*, a y.*)+£(*.y. a *«)]. 

2/S (a» a y«) = ~ [S 0. 2 y* 2 ) + S (xfy.yt)], 


**5(y,*) - — [S Or,*y,*) + 5 (x*y t 3 ) + 25 (x,y*x t ) + 2/S (<r,a; t y u 2 )], 

y*5(*.*) = ^ [5(* t a y*) + -S(«, 2 y< 2 ) + 25(a^y.y,) + 25 (x*y t y u )], 

xyS(x,y t ) =» ^ [ 8 (*/>V) + 25 (x,y,x t y t ) + 5 {x?y t y t ) + 5 (x t y?x t )+S (x,y a x t y u )], 

^ [5 (®, a y, a ) + s (*,V) + 2<S («, 2 y,y<) + 2S (x,y*x t ) + 25 (x*y t y u ) 

+ 25 (<r,a?,y„ 2 ) + 45 (x,y,x t y t ) + 45 (x,y,x t y u ) + 4S(«,a: < y 1 ,y„)]. 
These equations give, after substitution, 


p„=^[a(n-l)S (<r,V) + ( 2n -3) 5 (ar.*y* # ) - 2a (5 (*.*y.y ( ) + 5 (x,y?x t )) 

+ 4 (2n - 8) 5 (x,y,x t yi) + 4 (n - 3) S (x,y,x t y u ) 

+ 2 (n - 3) (5 («, 2 y,y u ) + 5 {x,x t y u ')} -125 (x,x t y v y.)] .(8), 

where « ■»«* - 8n + 8, a quantity which often appears in this work. 

Blometrika zzi 17 
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The above sums are the same as those which occurred in the evaluation of a 2 Pn 
Their means have been found. From these values we obtain, after simplification, 

= a + $ (P20 -P02 + 2 Pn 2 ) .(F), 

where a, ft are the same coefficients as those which occur in the value ofpai* 
Corollary 1. When N 00 

jta = n ^ 1 [(»i a - 3n + 3) P a + (2 n - 3) (P»P m + 2P u *)]. 


Corollary 2. For a normal population P 22 = (1 + 2 R 2 ) cr z V y a , and thus: 

P 22 = -P 22 ( a + ft), 

and for the case of N 00 



Since p<& and p 31 both reduce to the mean value of the fourth moment, m 4} in 
the case of a single variate, we shall obtain the same results from (F) as from (E) 
when we put X 8 ~ F a . 


7. The Correlation between the Variances in Samples from any Bivariate 
Distribution . 


For any sample, we have 




Similarly a 2 = ? -i—^ S (y 2 ) - 2 S (y a y t ). 

^ ft 71 

Therefore 

f [8 (<r,V) + s Wyt 2 )] + -1 I s ( x *y* x tyt) + 8(x,y,x t y u ) + 8(x,x t y u y v )] 

ft It 


- 2 ~«“ I s ( x ?y*yt) + 8(x,y, 2 x t ) + S{x,x t y 2 ) + 8(x 2 y t y u )] .(9). 

Thus we have once more the sums involved in the expression for pu 2 , equation 
(2). Using the mean values there obtained, we get, after reduction, 

A _(N-n)(n-l)P 22 \n-l 2(2a-3) 6(n-2) ] 

n » |_JV— 1 + (ZV — 1)(jZV— 2)(jV— 1)(iV - — 2)— 3)J 

2N(N —n)(n — l) Pu 2 1" 1 (n- 2) ] 

+ [(^-l)(iV-2) (N — 1)(N— 2) (N —3)J 


, NP m P m 
+ 




(n-l)(n—2)(n—3) 
(N—l)(N—2)(N—3) 


+2(n—1) 


( n-l)(w-2) ~ 
(N-l)(N-2)_ * 


o'* 


N(n-1) 

n(N-l) 


y(«-i) p 

n (N- 1) °*’ 


Now 
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so that 
and 


N» (» -iy 

°V ““ n 2 ] ^2 * 02i 


_(JV_„)(„_!) C f»-l 2 ( 
“. n® “ L **(iV-l + (iV- 


+ 22VP] 


, ,f_L 

u W-i)( 


(2 n - 3) 6 (« - 2) 

1 j (AT. _ 2) + (N- 1) (JV- 2)(JV- 3) 

ft — 2 


j 

{iV(W - 2) (n -1)-(» + !)}]. 


1)(N— 2) (iV — 1) (iV — 2) (N — 3). 

NPtoPoi 


(N-lf(N-2) (. N■ 

We shall find the correlation between <r* s and er„ 2 from the relation 


T- (o"* 2 ^ 2 “ .( 10 )- 




Thus we now require the standard deviations of the variances, viz. <r ffa .*, av y *. 
This is given by Corollary (2) to the result (B) and the standard deviations may 
be written in the form 

2 N( N-n) (w-1) JV . 

'* - a»(W-l) 2 (iV-2)(^-3) ( ’’ 

^ - a®'(iV-i)*(W-2)(2V-3)^ 2,ir '* 


where 

and 

Therefore 


<T**Oo* = 


^=(iV-l)(^ft-iV'-ft-l), 
i? = iV^-3^ 2 + 6^-3ft-3, 
/O — P» # _ ^04 

-«)(»- 1)P»P« 




'» «®(iV — l) 2 (A r — 2)(iV-3) 

Hence, using the above equation (10) for the correlation, this may be written 
in the form 

ilP 22 +2(iy~l)(Ar~ft-l)Pn 2 -P g ) P w { ff(Ar-2) (n - 1)- (ft-fl)} 




PaoPoa V(^l/3a, x~~ &) (^£?a> y — P) 
Corollary 1. When iV-^oo 


.(G). 


“ P»Po* V{(» - 1) - (» - 8)1 l(n - 1) A, y-(«-3)]' 

Corollary 2. For samples from an infinite normal population we must put 

Paa =*(1 + 2P 2 ) c^cTir 2 , ftiix — fttiY — 

and we obtain the simple result 

independent of the size of the sample, a result already familiar*. 

* Bee Wishart; loe , eit. p. 43. 


(n~l)(Paa-PaoPoa) + 2 P 11 2 
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8 . The Correlation between <rf and pn in Samples from a Finite Population 
with any given Distribution . 

We have the equations 


^ ( x ») ~~ ^ 


Pn= n n r s (*«y«) ~^ s 


. (re-1) 2 , . w 2 -2w + 3 0/ . . 3(re-l) 0/ „ x 

<t*Pu = 4 - £+ —- 4 — s W^tUt) - --— 4 — ^ 

Tv 71/ Tt 

- 5 u ) - - n ~— 5(« s 3 2/i)- ~ 4 3) s {x*x t y u ) 

TV 71 Tt 

0 

+ - i S(a;,a; t a;„y 1) ) .(11). 

Tv 

These sums are the same as occur in the expression for p 3 i, and their means 
have been found. Hence, using these values, and the following means, previously 
reached, 

-2_ N(n- 1) p — __ N (n-l) ^ 

* n(N- iy K ’ Pu ~n(N-l) Pn ' 


we obtain after simplification 


<TxPn - <r x 2 Pn ■ 


N(N—n) (re — 1) 


(APn-BPM, 


n 3 (N-lf(N-2) (N — 3) 
where A and B are the expressions in the previous section. 

Now to find the correlation between a/ and pn we need the following standard 
deviations: 

N(N — n)(n — l) 




n*(N-l?(N-2)(N-3) 
N(N-n)(n-l)P x * 

a “ 0 / %r 1 / \r 


[APn + (N-l)(N-n-l) P„P M 

- (re(iV-2 )(N + 1)-2(N- l) 2 }P u 2 ], 


- L ' V*’ "V V" ■»/ ■» a> {AO D\ 

n 3 (N—lf(N — 2)(N—S) D >' 

giving the result 

..._ APsg — BP;oPn _ 

'* ,Pl1 P»V^/8 Sli -PVlP M + (i\r-l)(Pr-n-1)P„P M -OPn 2 *■*' * 

where (7=«n(JV-2)(iV’+l)-2(iV-l) 2 . 

Corollary 1. When N-+-<x> 

r __ (w — 1)P M —(w-8)P»Pii _ 

'* ,Pu P»V(n-l)A,x-(n-3)V(n-l)P„+ P»P w -(n-2)P u > ’ 
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Corollary 2. For samples from an infinite normal population, where 
P a -3i2^V r> P» = (l + 2PVxV 2 , &,x = 8, 

FTbF 

r «**.Pn “ Y i + jp* 

once again independent of the size of the sample, a result already known*. 

9. The Correlation of (a) m» arid rriy, ( b) and m„, (c) and /> u , in Samples 

from a Finite Population with any given Distribution. 


(a) We have: 


m » m v * 1 


We also find: 


S(x.y.) + 8(x,y t ) 

- 1 \nP n ( n ~ 1 ) p 1 
= n*l nFn -N(N-T) Pn ] 

_ (N~n) 

n(N-l) u> 

m x = my** 0, 

, 3 _ ft j) o _ W ft 


H ““ r --. - T,irif"-/^T) ^-.» 

It will be observed that this result is true for any sampled population, not 
necessarily infinite or normal, and it is independent of n, the size of the sample. 

(i b ) We have: 

** = ( n -^r) 8(x*)-* a S ( x,x t ), my = ^S(y„), 

•r<My = [fif(«;, 2 y,)+£(*, s y*)] - [S+£ («,««?„)] .. .(12). 

If we refer back to Section (4) on pai, we shall find the mean values of the 
above sums written down. These give 

_N(n-l)(N-n)P n 

<r « mv ~~n*(N-l)(N-2) - 

Now o^wiy — 0, since my — 0, 

N(N-n)(n-l)P K K , JO m _ s _(N-n)P M 
and „8/»r_itt7» o\/■xr„o\(A8t>x 8), a ^ n(N—\) ’ 


n 8 (2V-l)»(tf-2)(P-3)' 


pvu.g ^^"(Fi rg - p -V '- # JW ^-iT -‘. WT * 

* See Wishart: loc . cit. p. 43. 

t It will be observed that both results (E) and (L), when we put X-Y, reduce to the result obtained 
by Dr Neyman for the correlation between the mean of a sample and its varianoe, given in equation (67)* 
Biometrika , Tol. xvn. p. 479. 


Pn / (n-l)iy(iyr-l)(jyr-2)(N-3) K . 
^moV Pm(Afa, x -B) . WT> 
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Corollary. When 




*' ** -^20 ^ P(i2 * (ft —1)$2, JT — — 3) 

Since for a normal population P 21 = 0, ry^s ?ny will be zero in this case. 

(c) We have: p u = 1 8 (x s y„) ~^- 2 S (x, y t ), m* = ^ S (x,\ 

™*Pu = S (x*y,)-±S(x*y t ) + n -=^ S(x,x t y t ) - 1 8 (*.*«*.) . ..(13). 

tv 7 L 71 71 

Again using the mean values of the sums found in ;> 2 i we get 

JV(n-l) (TV-n)P ffl 


7l 2 (TV — 1) (N - 2) 


the same result as for a x 2 m y . 


Hence, remembering that m x p n = 0 and using the values of a 2 mx and <r 2 Pll as 
before, we obtain the correlation 


r = _ ?*__ J' <* - 1 )N(N-1 )(N- 2) {N- 3) __ 

mx,Pu (A-2)Vp a) V i> 22 + (iV r -l)(W-n-l)P ao Po2-CP 11 * ^ ' ’ 

Corollary . When N->oo 


f m x ,p n • 


V /_ 


(n-1) 


VP» V (n - 1) P 22 + P 0 P 02 - (» - 2) Pu 2 ' 


10 . The Standard Deviation of (a) m x — m y , ( b ) m x m y . 

(«) «** - = “ [S (x s ) - ,V (y,)], 

(% - ™*) a = i [5 W) + as (.r 8 .r ( ) + (y/) + 2&(y.y,) - 2S(x,y.) - 2 S(x,y t )l 

- »[y ■ NP -- n orfij N1 ‘" + y - j$£0 

- 2n NP + 2n (w ~ 1 ) ATP 1 

A ^ Pu + i^(A-l)^ Pu J 


Since w e -w # = 0, we have 

N-n 

V^mx-my — _ 2) (°** + °*r 8 ~ ^Pcr x CT F ). 

For N~*oc this agrees with the familiar result 

= (~f) + (J-r) - 2 (Jf) (5f) R- 


See footnote f, p. 245. 
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(b) 


mjniy 2 


m* a - 3 [S (x, 2 ) + 28 (x.x t )], m 2 = ± [S (y 2 ) + 28 (y.y t )], 

^ T * 8 (x 2 y 2 ) + 8 (x 2 y?) + 28 (x 2 y,y t ) + 2 8 (x,y 3 x t ) + 28 (x 2 y,y u ) 


+ 28 (x,x t y 2 ) + 4 S (x,y,x t y t ) + 4 S (x t y a x t y u ) + 4 S (x,x t y u y,)]. . .(14). 

Using the values of the means of these sums, previously found in § 3, it will be 
found, after reduction, 

- 2 - 2 . . (tr-n)P n r 1 6 (n- l)(iV-n-l) ] 

* * “ n 3 IN-1~(N-1)(N-2)(N - 3)_ 

^ N (N - n) (n -1) (N - n - 1) (P* P <*+ 2 Pn 2 ) 


From previous work, we also have 


n 3 (N — 2) (N — 3) 


( N-n ) p 

»l.»« = '/if -Til. 


x "^~n(N- I)' 

Hence <P mx ^ = m x 2 m 2 - (m x m v f 

_(JV-«)P !!2 r 1 6 (n — 1)(iV — n — 1) "] 

L-^ — 1 (N- 1 ) (N - 2) ( N - 3)J 


l)(N — 2 ) (N 
iV(iV— n) (n — 1 ) (N — n — l)P»Poi 
n 3 (N— 1) (N - 2) (AT-37 
(lV-n)iVf 2 iV(n-l)(iV-n-l) n(iV-n)'] < 
»• [.(iV - 1 ) (iV - 2)(N — 3) (iV- 1 )® 




11. TAe Standard Deviations of (a) pn and pw, ( b ) p 2 2 > mi Samples from an 
Infinite Population with any given Distribution . 

(a) I had worked out this case for a limited sampled population but owing 
to the great length of the final result which I obtained in an unsimplified form 
I decided for the purposes of this paper to confine myself to the case when the 
sampled population is infinite. Even here the result is rather long, but it reduces 
considerably when the population follows the normal law. 

We have already expanded p Zi in sums of x 8 and y 8 . For the standard deviation 
of p%\) we must square this expression and find the mean values of the resulting 
sums. As we are taking N infinite, many of these sums will vanish when their 
mean values are taken, owing to the fact that S(X 8 ) and S(Y 8 ) vanish in the 
sampled population. Therefore, in the following expansions which are needed for 
pzi 2 , I have omitted these sums. Thus: 

[S (x*y.)] 2 - S (x*y 8 2 ) + 2 S (x*y 8 x t * y t \ 

[S {x 2 y 8 x t )J = S (x*y 2 x t 2 ) + 2 S (x 2 y 8 x t s y t ) + 2 S (x 2 y 8 x?y t x 2 ) + etc., 

[S (x 2 x t y t )] 2 = S (xfx?y?) + 2 S (x*y 8 x?y t ) + 2 S (x*x t y t x u y u ) 

+ 2 S (x z y 8 x t y t x 2 ) + 2 S (x 2 y 2 x t 2 x u 2 ) + 4 S (x 8 y 8 x t y t x u 2 x 2 ) } 

* This will be found to agree with Dr Neyman’s formula (59) in Biometrika , Vol. xvn. p. 478, on 
putting XslY. 
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[S (x,y a x t x tt )] 2 = S (x 2 y 2 x t 2 x u 2 ) + 28 (x 2 y a x t a y t x u 2 ) 

+ 2 S (x s y e x t y t x u 2 x 2 ) + etc., 

[S (x a y t )] 2 = S (x*y t 2 ) + 2 S {x a y,x t a y t ) + 2 S (x a x, a y u 2 ) + etc., 

[8 (x 2 x t y u )] 2 = 8 {x*x t 2 y 2 ) + 28 (x a x t 2 y,x u y u ) + 2 S (x*x,y,x u y u ) 

+ 2 S (x a x t a y 2 ) + 2S (x 2 y a x t 2 y t x 2 ) + 6 S (x 2 x t 2 x u 2 y 2 ) 
4- 4 8 (x a 2 x a x u y u x v y v ) + etc., 

[<S = S (x 2 x t 2 x 2 y 2 ) + 2 S (x 2 x t 2 x u y u x v y v ) + etc., 

8 (x a y,) S (x 2 y,x t ) = 8 {x*y a x t 2 y t ) + etc., 

8 (x a y s ) S (x?ix,y t )=S(x a y,x t y t ) + S (x*y 2 x t 2 ) + S (x a y,x t y t x u 2 ), 

8 (x a y a ) 8 (x a y t ) = S (x a y 2 x t a ) + etc., 

S (x 2 y,x t ) S (x 2 x t y t ) = 8 (x*y a x t 2 y t ) + S (x a y 2 x t a ) + S (x a x t 2 y t x u y u ) 

+ 2 S (x a y,x t a y t x u a ) + etc., 

8 (x a y,x t )S(x,y a x t x u ) = S (x a y t x t y t x 2 ) + etc., 

8 (x 2 y,x t ) S (x a y t ) = S (x a y,x,y t ) + S (x,*x t 2 y t 2 ) + S (x a x t 2 y t x u y u ) + etc., 

S (x 2 y,x t ) S (x 2 x t y u ) = S (x a y a x t y t x 2 ) + 28 (x 2 y 2 x t 2 x u 2 ) + etc., 
S(x 2 x t y t )8(x a y,x t x u ) = 8 (x a x t 2 y t x u y u ) + etc., 

S (,x 2 x t y t ) 8 (x a y t ) =S (x a x t y t 2 ) + S (x, i y a x t 2 y t ) + S {x, a x t 2 y t x u y u ) 

+ 8 (x, a x t y t 2 x u 2 ) + etc., 

(x 2 xtyt)S(x 2 x t y u ) => S (x a x t y t 2 x u 2 ) + 2 S (x 2 y a x t 2 y t x u 2 ) + etc., 

8(x a y a x t x u )S (x a y t ) = 28(x*x t y t x u y u ) + etc., 

8 (x, y^J S (x 2 x,y u ) = S (x a y,x t y t x 2 ) + 8 (x 2 x t y?x 2 ) + S (x a a x, 2 y,x u y u ) 

+ 4 S {x a y,x t y t x 2 x 2 ) + etc., 

8 (x a y t ) 8 (x 2 x t y u ) = S {x*x t 2 y u 2 ) + 8 (x a y,x t y t x u 2 ) + etc. 


To find the tnean values of these sums, it is necessary to express the corre¬ 
sponding sums in X and Y in terms of the product moments of the sampled 
population. However, it is only necessary in each case to consider the term with 
the highest degree in N as those of lower degree vanish when N is infinite. 


For example, 

Mean 8 (x a y,x t a y t ) = Lim S 

T . n (n — 1) N 2 Pai 2 — NPg» 

- Lim ~r=—f. -- 

= $7l(n-l)Psl , , 

Mean 8 (x, a y?x?) = n(n — 1) P^Pjo. 

Mean S(x a y,x t y t x „ a ) = n (n -1) (n- 2) P»P»Pii, 

Mean 8(x a 2 x t 2 x u y u x v y v )= J n (n - l)(n — 2) (n - 3) P»*Pn*, 

and similarly the means of the other suras may be written down and hence the 
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value of We have already found the mean of pa in (E) and when N is infinite 
(paY - ( ~[aW + 9 ( 2 m -3)* P*W + 6 0 (2n-3)PaP S oPu], 
where, as before, a « n 2 — 3n 4 3. 

Hence, from the equation 

it will be found, after some reduction, that * 

- a* (n - If (P m - Pax 3 - 6 P 41 P 21 - 2 P M Pao) 

4* (w — 1) (PfloPw 4“ QPi&PiQ 4* 6 PmPu 4” lOPg* 2 4* 6P40P22 
4- 6 (a — 2) PsoPsaPn 4- (a — 2) Pgo a Po2 4 9 (a — 2) Pai 2 Pao} 

4 9(a — 1) (2a — 3) 2 [PioPw 4" P31 2 “1“ (a — 2) (Pu^P^ 

4* 2PsiPaoPn 4- Pao 2 P22 4 (a — 3) Pao 2 Pn a )3 
4- 9 (a — 1) (a — 2) (a — 3)* {6PgoPaiPu 4 4PgoPaoPia 4" 5Pgi 2 Pao 
+ Pm 2 Pot + PwPfnPiQ 4-4P8iPao-Pn 4- Pn a P40 4 2Pgo 2 p22 
4 (a — 3) (Pao 3 Poa 4 7Pao 2 Pii 2 )} 

4 6 a (n — l) 2 ( 2 a — 3) (PriPii 4 PeaPao — 2PaiPao P 11 ) 

— 6 a (a — 1) (2a — 3) {Pgo P 12 4 4P 41 P 21 4 3P 32 Pao 
4 (a - 2) (4P*)Pa P u 4 PaoPaoPia 4 3Pa 2 Pao)l 

— 6a(a — l)(a — 2)(n — 3) (lOPgiPaoPu 4 P4oPoaPao 4 2Pu 2 Pio 4 3P2o 2 Paa) 
4 18 (w — 1) (a — 2) (a — 3) (2a - 3) (Pao P20 Pi 2 4 2Pgo P a P u 4 Pu 2 Pao) 

4 54 (a — 1 ) (a - 2 ) (a - 3) (Pao 3 Poa 4- 3Pao a Pn 2 ) 

— 9a (a - If (2a - 3? PrfPu 2 .(M). 

Corollary 1. For a normal population, the product moments above have the 
values: 

Pea = 15 (1 4 6P 2 ) cr z # ar y * Pa - 3 Ra x *<r Y , Pe 0 = lSo* 6 , 

Pea = 3(1 4 4P a ) cr^cry^, P 51 » 1 bRa^ay, P 22 = (1 4 2 i£ 2 ) ojr a <Ty 2 , 

P40 = 3 <ry 4 , P41 = Pai = Pia = -Paa— Pao s= 9 . 

On substitution of these values, o*^ reduces to the value 

v'rtx - 3fa ~y :t8trr! > [(6» 8 - 12n + 9) + 3 R* (9n* - 20n +13)]. 

Corollary 2 . Put X » F in the result, thus giving the standard deviation of 
the fourth moment in samples of a. I have written the result in terms of the s 
of the sampled population, thus: 

- «* (n — 1? [(& - 105) - 8/S, - (A - 3/ — 6 (ft - 3)] 

+ 16o* (n — 1) [(& —15) + (& — 3)* + 6 (fit — 3) + (n — 2) ft] 

+18 (n - 1) (2n - 3)* [</8, - 3)» + 2 (n + 1) (ft - 3)] 
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4 72 (n - 1) (n -2 )(n- 3) 2 [(& - 3) 4 2fr] 

4 12 a (n - l) 2 ( 2 n - 3) [(/3 4 - 15) - (ft - 3)] 
- 48a (n - 1) (2 n - 3) [& 4 (« - 2) ft] - 96a (n -1) (» - 2) (n - 3) (ft - 3) 
4 72 (« ~ 1) (n -2){n- 3) (2n-3)ft4 24n* (» -1) (4w 8 - 9« + 6 ) ...(15), 
where a = r? — 3 n *f 3. 


Corollary 3. When the sampled population is normal all the terms, except the 
last, vanish in the above result giving, in this case, 


ff*™, “ 24 ) (4ii a - J>n + 6) (, 8 

n* ( n — 1 (m — l) 2 


(b) Referring back to equation (8) giving the value of P&, we shall require the 
following results in the expansion of p^ 2 : 

[S(x 2 y 2 )J = s (*„V) + 2 S(x 2 y 2 x t 2 y t 2 ), 

[8 (x 2 y, 2 )] 2 = S (x,*y*) + 2 8 (x 2 y 2 x t 2 y t 2 ) + 28 (x*y?y 2 ) 

+ 2/S (x 2 y 2 x t 2 y 2 ) + 2 .S' {x 2 x t 2 y v *) 4 4 8 (x 2 x t 2 y 2 y 2 ), 
t'S ( x s 2 y,yt)f = 8 (x*y 2 y t 2 ) + 2S (x 2 y 2 x t 2 y t 2 ) 4 2 S (x 2 y s x?y t y 2 ) 4 etc., 
[S (x a y a x t y t )J=S(x 2 y 2 x t 2 y t 2 )+2S(x 2 y 2 x t y t x u y u )+C>S(x s y s x t y t x u y u x v y v ), 
[S ( x *?/* x ty u )f = 8 (x 2 y 2 x 2 y u 2 ) 4 23 (x 2 y, 2 x t y t x u y u ) + 2 8 (x 2 y a x t y t 2 x u y u ) 
4 2 /S (x 2 y t 2 x u y u x v y v ) + 2S (x 2 y s x t 2 y t y 2 ) 

+ 2-S {x,y 2 x t y t 2 x 2 ) 4 24/S (x a y,x t y t x u y v x v y v ) 4 eta, 

L'S(« 8 2 .y t y„)] 2 = 8(x*y t 2 y u 2 ) 4 28(x 2 y a x t 2 y t y 2 ) 4etc., 

[S (x s x t y„y v )Y = 8 {x 2 x t 2 y 2 y 2 ) + 2S (x a 2 y t 2 x u y u x v y v ) 

4 6 S(x a y s x t y t x u y u x v y v ) + etc., 

8 (x 2 y 2 ) 8 (x 2 y?) = 8 (x*y 2 y t 2 ) + S (x 2 y,W) 4 8 (x 2 y 2 x?y 2 ), 

8 Wy?) 8 {x?y,y t ) = S (x, 2 y, 2 x t 2 y t ) + etc., 

8 Wy?) 8 ( x a y,x t y t ) = 8 (x, 9 y, 3 x t y t ) + S (x 2 y 2 x t y t x u y u ), 

8 Wy?) 8 ( x » s y,yi) = 8 (x*y,yt 9 ) 4 8 (x a 2 y 9 x t 2 y t ) 4 28 (x 2 y,x?y t yS) 

+ 8(x, 2 y,x t 2 y u a ) 4 etc., 

8 Wy t 2 ) 8 (x,y,x t y t ) = 8 (x?y,x t y t 9 ) 4 S (x a 9 y,x t y t y 2 ) 4 8 {x,y 9 x t y t x 2 ) 

+ 8(x 2 y t 2 x u y u x v y v ), 

8 («.V) 8 (x a y,x t y u ) = £ (x*y t a x u y u ) + 8 (x, 2 y,x t y t 2 x u y v ) 4 etc., 

8 ( x ?y?) 8 (x 2 y t y u ) = S (x 2 y e x t 2 y u s ) 4 etc., 

8 (x t y,x t y t ) 8(x,y,x t y u ) = S(x, 2 y,x t y t 2 x u y u )4 etc., 

8(x,y,x t y t ) 8 (x, 2 y t y u ) = 8 (x, 2 x t y?x u y 2 ) 4 etc., 

8 WyiVt) 8 (x,y,x t y t )m 8(x, s y, 2 x t y t 2 ) 4 8(x, 2 y,x t y t 2 x u y u ) 4 etc., 
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8 (x, a y,y t ) 8 (x,y,x t y u ) = 8 (x, a y,x t y t yj) + 2 8 (x a y,*x t y t x u y u ) + etc., 

8 8 («,*y<y*>=$ Wy?x?y a )+ etc., 

^ ( x *y>yt) S (x,x t y u a ) = 5 (x a y,x t y t y a ) + etc., 

® WytVu) 8 (x.Xtyu) = 8 (x, a y t a x u y u ) + S (x.*y, a x t y t x u y u ) 

+ 8 (x, a y?x u y u x v y v ) + etc., 

8 (x, a y.y t ) 8 (x,x t y t *) = 8 (x, a y,x t y t a ) + 8 (x a y a x t y t ) + 8 (x?y,x t y?x u y u ) + etc., 

.(m 

The mean values of these sums may be written down as in the case (a). Thus, 
in repeated samples, 

Mean 2/S (x a y a x?y?) «n(n-1) P K 2 , 

Mean S(x a y t *)*=n(n-l)P m Pot, 

Mean 2«S (x a y?y a ) = n (n - 1) (n -2)P 40 Poa*, 

Mean 4/S (x, a x t a y u 2 y a ) = n (n — 1) (n — 2) (» — 3) P^Pm 2 , 

Mean 24/S (x,y,x t y,x u y u x„y v ) -n(n- 1)(k —2)(k- 3) P u 4 , 

and so on. 

Corollary 1 to the result (F) gives the value of p& when N is infinite, so that 

W = + (2k - 3)* (P M P M + 2 P n *?+ 2a (2k- 3) P* (P»P w +2P u a )]. 

where a = ?i 2 — 3n + 3. 

Hence, from the equation 

we obtain, after simplification, 

»V fti « a* (n -1) 2 [- a + d - 4 (t + i')] 

+ (k — 1) (2k — 3)® [9a + 16 (k — 2) 6 — 8 (2k - 3) c + 2 (k - 2) <7 — 8 (2k — 3) h 
— 2 (2k — 3) k + 8m + (n — 2) (q + q ') +1 + 8 (k — 2) (m + k')] 
+ 4a® (k — 1) [2a + (k - 2) (e + e') + 2 (k — 2)/+ 2y + ((+ V ) + 2m] 

+ 4 (k— 1) (k — 2) (k — 3)® [56 + 4 (»— 3) c + 6 (e + e')416/+4gr+13 (k—3) A. 

+ (k - 3) k + 4 (p+p') + £ (q + q') + 2r + 4 (m + u')] 
+ 36 (k — 1) (k — 2) (k — 3) [c + 4ft + A] 

+ 2a (k - If (2k - 3) [- 4ib-2g + 4 j+(l + /')] 

— 4a (k — 1) (2n — 3) [(k -2) (e + e) + 8 (k — 2) /+ 5 (i + »') 

+ (k - 2) (p + p ’)+(« + «')] 

+ 4 (k — 1) (k — 2) (k — 3) (2k — 3) [2 (e + e') +10/+ (p+p') + 2r] 

— 8a (k — 1) (k — 2) (n — 3) [46 + 2g + 3 (u + u')] .(N), 

where a = n® — 3k + 3, and a, b, c, etc., have the same values as in equation (21). 
This results from the fact that the same sums occur in the expressions for p M a 
and pu* 
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Corollary. For a normal sampled population, the above product moments have 
the values stated on pp. 249 and 254 giving, on reduction, 

. t fe [ ( 5 » + »» + i) - + g^l] ■ 


12. The Third and Fourth Moment Coefficients of the Distribution of pn in 
Samples from an Infinite Population . 

The third moment coefficient of jpu may be written in the form 
Pn M s = Mean value of ( p n — pn) 8 

« Pn 3 - (Pn) d - 3<r 2 p u Pn .( 18 ). 

We have already evaluated p n and o 2 Pny so that it remains to find pn 8 . This 
I have done by multiplying the sums in equation (2) by 

Pn = ~~jrS(x,y,)-^8(x,y t ), 

and finding the mean values of the resulting sums. These were as follows: 

8 (x,y,) S (x 2 y 2 ) = 8 (x 3 y 3 ) + 8 (x 2 y 2 x t y t ), 

8 (*,y.) S (x, 2 iI?) = 8 (x 2 y s y t 2 ) + 8 (x 2 x t y t 3 ) + 8 (x s 2 y t 2 x u y u ), 

S (x»y») 8 (x,y,x t y t ) = 8 (x 2 y 2 x t y t ) + 3 S (x,y, x t y t x u y u ), 

8 (x,y,) 8 (x 2 y s y t ) = S (x 2 y t x t y t 2 ) + etc., 

8 (x,y s ) S (x„x t yt 2 ) = S (x, 2 y s x t y t 2 ) + etc., 

<S (x s y t ) S (x 2 y t 2 ) = 8 (x 3 y, a ) + 8 (x 2 y,x t y t 2 ) + etc., 

8 (x»y t ) 8 (x,y,x t y t ) - 8 (x 2 y,x,y t 2 ) + etc., 

8 (x,y t ) 8 (x 2 y,y t ) = 8 (x 3 y a y t 2 ) + 8 (x 2 y 2 x t y t ) + etc., 

8 ( x,y t ) 8 (x,x t y t 2 ) = 8 (x,y 3 x t 2 ) + S (x 2 y 2 x,y t ) + etc. 

I have again omitted the sums which vanish when their means are taken. Now 
we have 

Mean S(x 3 y 3 ) = nP&, 

Mean S(x 2 y 2 x t y t ) = n(n-l)PtzPn, 

Mean <S (x a y,y t 2 ) » n (n - 1) P n P M , 

Mean S(x 2 x t y t a ) = n (n — l)PnPx>, 

Mean S(x 2 y t 2 x u y u ) «n (n - 1)(n- 2) P K PoaPn, 

Mean S(x,y,x t y t x u y u )=^n(n-l)(n- 2)(n-3)P u 8 , 

Mean 8(x e 2 y,x t y t 2 ) = n(n-l)P a P lt , 

Mean S (x t 3 y 3 ) = n(n — 1) P x Poa • 

Hence, using the values, when N is infinite, of 
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we may write, from equation (18), 

” 5 (p«^ 8 )* (n-l)3[6 + (n-l)c] + (M^l) a [d + 6 + ^-~ 2)/3 

+ a (n — l ) 2 [2c + (n — 2 ) ^r] — 4 (n — l ) 8 A — (n — 1 ) (i + A) — 2a (n — 1) A 
+ 2 (n -l ) 2 (2c + d + e) - 4 (n -1) (n - 2) / 4 - 2 (n -1) (n - 2)* (/ 4 * g) 
-n*(n-l)*g-3n(n-inf+(n-l)c-(n-2)gl 
where b = -P 33 , cssPu^M) ^ ssa PoaPsii 

c = Pao-Pi 8 , / = PnPsoP<> 2 » # = Pu 8 > 

A » P 21 P 12 > i * Pso P 03 > a*»(n — 1 )*+ 1 . 

I simplified the above by collecting terms in n 8 , w 2 , n, and the constant (the 
terms in w 6 and n 4 vanishing) and finally obtained 

Pn^3 5=5 ^(Pas — 3 P 2 aPu + 2 Pn 8 — 6 P 21 P 12 ) 

71 

- i<3P„- 15P*2Pn -3P a P M -3P U P«, + ^PuPjoPo* + 14P U »- 18P a P u ) 

71 

•+• “4 (3P83— 2 IP 22 P 11 —6PsiPoa— 6P 13 P 20 4" 30Pup2oPo2“b 24Pn 8 — 2 IP 21 P 12 —PsoPm) 
n 

- i(Pj 3 -9P a Pu- 3P n P M - 3PisP*o + 18PuP*oPo*+12Pu , -9PnP U -PaoPos) 

.(19)*. 

The case of the fourth moment coefficient, although somewhat longer to work 
out, is simpler than the previous case, owing to the fact that the expressions 
involved are symmetrical and that the sums occurring in P 2 & are the same as those 
used to find pu 4 , so that their mean values have been found. We now write 

6(fri?<r*p ll -(puY‘ .( 20 ), 

so that we have to evaluate pu* t as the other terms are now known. 

The value of pu a in equation ( 2 ) has now to be squared, and the values of the 
squares and products of the sums are given in equation (17); their mean values 
are also known. Thus Mi may be written down from equation ( 20 ). To shorten 
the writing I have put the result in the form 

- 3w« (a — 2b + c) 

+ n s {-12a + 366-21c + d + 12(e + e') + 72/ 

4 - 6g — 6A —12 (i + i') — 4 j\ 

* This agrees with the result of Dr Church in Biometrika , Vol. xvn. p. 82, giving the third moment 
coefficient of the distribution of variance in samples from an infinite population, when we put X t =*Y 9 in 
the above equation. * 
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+ n a 130a - 1506 + 108c - 3d - 60 (e + e') - 288/- 48# + 162A + 36 (i+i) 

+ 24j + 9A:+6(i + 0 + 12m~48(u + tt')-12(jp + j p')| 

+ n {- 36a+ 2526 - 216c + 3d +108 (e + e) + 468/+ 1(% - 504A, 

- 48 (i + %) - 36jf - 4 ok -12 (i + 0 - 24 m + 132 (ti + O + 48 (p +p') 

+ 3 (q + j') + 20r - 4s} 

+ {18a -1446 + 144c - d - 72 (c + a') - 288/- 72^ + 432/t + 24 (t + i) 

+16^' + 54i +6 (l + l')+ 16m — 96 (u + u') — 48 (p + p') 

- 6(3 +/)-32r + 4s + £} .(21)*, 

where a = P/, P n 2 P&, c = P u 4 , d = Pu, 

e = P/P 02 , f= PziPnPiii 9 — P 22 Pm P 02 > h — P u 2 Pao P 02 , 

1 = P21i 3 28 J j~ P33P1I} k = p 2 Q 2 Pm 2 > l = PtoPoii 

m = P31P18, W = P31P11P 0 2> P = P 30 P<)2 Pl2 1 ( l = p20 2 P(H > 

^ = p 3 oPo 3 Pll> S = P41 P03, ^ = P40 P() 4 ■ 

The dashed letters denote the functions obtained by interchanging X and F; so 
that e' = P 12 Pm > i' = PnPa 2 > and so on. 

13. The @1 and (3% of the Distribution of pu in Samples from an Infinite Normal 
Population f. 

The results (A) and (B) and equations (19) and ( 21 ) give the first four moment 
coefficients of the distribution of pu in samples from a population not necessarily 
normal. 

Make N-+*cc in (A) and (B) and substitute the normal values of the product 
moments in the sampled population, which are as follows: 

P 22 - (1 + 2P 2 ) <r A W, P 33 = 3 R (3 + 2P 2 ) a/a y 3 , P 3 1 - 3Per/a y , 

P 44 - 3 (3 + 24P 2 + 8P 4 ) a/a/, P 42 = 3 (1 + 4P 2 ) a/a y 2 , P 40 = 3a/. 

The remaining moments, P a& , in which a + 6 is odd, vanish in this case. 

The four moments may now be written: 

= (~) R< T x ff Y> 

pu **- (^-)(i + ^V*W. 

Pa Mi = 3 (ff ) {rc (1 + R*y + (1 + 10i2 2 + R*)\ <r*V/.(22). 

* Transforming the result above to the ease of one variate, it will be found to agree with the value 
of given by Dr Church in Biometriha , Yol. xvn. p. 88. 

t See Wishart, op. cit. p. 42. 
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Hence 


ft' 


A- 


Jf 8 *_ 4.R*(.R* + 3)* 
37? (n—l)(ii*+1)*’ 


if.* 


= 3 + 


6(l + 6fl* + .R«) 


.(23), 


(n-l)0K* + l)*j 

The values of ft(7i-l) and (>Sa — 3)(n — 1) have been plotted for different 
values of R and the graphs are shown in Figures I and II respectively. For a given 
value of R y ft and (ft — 3) are found by dividing the ordinates by (n — 1). 


The result of eliminating (n — 1) from equations (23) gives 

3ft (1 4* R 2 ) (1 + 62i 2 + R*) - 2ft R* (3 + R*)* + 6R*(3 + R 2 ) 2 - 0 .. .(24). 

Thus, for any given R, the point (ft, ft) lies on a straight line passing through 
the Gaussian point ft =0, ft » 3. 


When U = 0, the line becomes ft = 0, and when 22 «1, (ft,ft) satisfies the 
condition for the Pearson Type III curve, viz. 2ft — 3ft — 6 = 0. 

From liquations (23) it is seen that as n-*-oo, ft-*0 and ft-*-3, the normal 
values. 


Waen R is eliminated from Equations (23), we obtain 

(<f> - 2) 8 + (20 - 3<f> + 2? « 0 .(25), 

where 6 = 1 j ft > <t> = g ~ ^ (ft - 3). 

Thus, for any given size n of the sample, (ft, ft) lies on the cubic curve (25). 

The system of straight lines (24) and the system of cubics (25) are shown in 
Figure III, which enables the values of (ft, ft) to be read off from given values of R 
and n. The rapid approach to normality as n increases is seen from this figure, and 
for high values of the correlation R f (ft, ft) lies very close to the Type III line, 
2 ft - 3 ft -6 = 0 . 


The main Pearson Types covered by the points in Figure III are IV and VI, and 
Types III and VII arc obtained for the special cases of R = 1 and 0 respectively. 
For the particular case of n = 3 and R = 1 Equations (23) give ft = 4, ft = 9, that 
is: the Exponential Point in the (ft, ft) plane. 


14. Summary . Finally, to give some indication of the nature of the formulae 
provided in this paper, I have summarised them in the manner shown below. 

I. General values for any distribution and for a limited sampled population 
have been found for: 

(а) Mean values of p n , p 12 , p M , p p 18 and consequently of m*, 7 %, 7714 . 

(б) Standard Deviations of p u , p 12) p n and consequently of m 2 and 7n 8 . 

(c) Correlations between m wt m y > <r x 2 , <r„ a , p n * 

Of the above the univariate results for 7 * 1 *, m 2 > m 4 , <r 2 Mt and r mXt<rx * are easy to 
compute in any given case, while a 2 m$ (see Corollary 4, p. 239), although rather 
long, does not present an insuperable amount of arithmetic labour to use in practice. 
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The bivariate results in this general case have a theoretical interest and may 
be used, for example, in the determination of certain bivariate distributions. 

II. General values for any distribution and an infinite sampled population have 
been found for: 

(a) The standard deviations of p 31 , p n , p 22 and consequently of m 4 . 

The result for (p. 249) has been given in terms of the s of the sampled 
population and is quite easy to compute. 

(b) The third and fourth moment coefficients of p n and consequently of m 2 
(or s 2 ). These results, already obtained by Dr Church, arc quite easy to compute. 

(c) The univariate results in I. These are correspondingly easier to compute 
when N oo. 

III. Values for an infinite normal sampled population have been found for: 

(a) All the results given in I and II. These reduce to very simple forms, the 
univariate results involving only the size of the sample ( n ) and the standard 
deviation (a) of the sampled population, and the bivariate results involving w, the 
correlation R of the sampled population and a x > *y the standard deviations of 
X and Y. These results are all easy to compute. 

(b) The and fi 2 of the distribution of pn. These are simple expressions in 
n and R (Equations (23)) and have been computed for different values of n and R, 
illustrated in Figures I, II and III. 

In conclusion, I wish to thank Professor Karl Pearson for suggesting the sub¬ 
ject of this paper to me, and for his invaluable advice and kindly encouragement 
throughout its progress; also Miss Ida McLearn for her preparation of the diagrams. 
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1. The Use of “Student’s” a-Test with Populations not Normal. 

One of the most important problems with which the mathematical statistician is 
faced is that of bringing his theoretical structures into some degree of correspondence 
with the situations of practical experience. This is no doubt hardest when the 
samples are small, for here he will often be faced with two difficulties. In the first 
place his populations may not be completely stable; the sample when drawn may 
be a random one, but owing to difficulties in control or to some changing time 
factor he cannot be sure that it will be quite the same population with which he 
will be concerned in further work And then, even if he is sure of the stability of 
his population, it will generally be impossible for him to obtain any certain estimate 
of its exact form. For purposes of inference he may calculate from the sample one 
or more statistical measures, but the first difficulty will make him hesitate to lay 
too much stress on the exact value of the figures found on entering his probability 
table, while the second may make him wonder whether there is after all any 
appropriate table in existence. 

The questions of stability and randomness of sampling can only be dealt with in 
each problem as it arises, but though the statistician may be prepared to accept these 
conditions as approximately true, he is still faced with the second problem. “The 
majority of tests dealing with small samples,” he may say, “have only been worked 
out for the case in which the population distributions are normal. I do not know 
whether my distribution is normal, although from my general experience in the 
past I do not think that it is likely to be excessively skew or leptokurtic. How 
sensitive are the ‘normal theory’ tests to changes in population form ? May I use 
some with less hesitation than others ?” 
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In the present paper an attempt will be made to answer this question as far as 
it concerns some of the tests connected with “Student’s” Type VII distribution of 
z t the two fundamental tests considered being those dealing with the mean of a 
single sample and the difference between the means of two samples. It may be 
well to illustrate the problem by taking a concrete example. A commercial firm, 
let us suppose, is considering whether to introduce a new method of production, 
which may be of advantage perhaps either through a saving of time or because it 
seems likely to lead to an improvement in the quality of the article produced. 
A series of experiments is carried out in which some variable quantity x is 
carefully observed under both methods. As a result two samples are available, one 
of n x values of x with a mean x x and standard deviation s Xf the other of n 2 with x 2 
and 8 t . In this case the most useful answer that statistical analysis could give 
would perhaps be as follows: “The exact difference that would be found to hold 
in the long run between the average values of x arising from the two methods 
cannot of course be determined, but the odds are Tc to 1 that this difference lies 
between dj and d 2 ” With such an answer as this before it the firm could decide 
whether the innovation showed an improvement of sufficient significance to be 
commercially profitable, or if the question remained in doubt whether it seemed 
worth the expense of undertaking further experiments in order to narrow down 
these limits, d x and d 2 . But unfortunately an answer in so precise a form cannot be 
given without serious assumptions which at once destroy the precision. 

If the samples are small we may use R. A. Fishers two-sample 2 -test* and 
calculate 

z m fr'i — % 2 — d j n x n 2 yjv 

n x s^ + n 2 s 2 2 * n i + ^2 


Then choosing a suitable value of a, such as ‘01, and entering “Students” tables, 
we may find the values of d x and d* corresponding to z = ± z a for which P z =* 2a, 
where 


Pz= 2 



n x + Tig - 1 
2 


dz 


( 2 ). 


On the assumption that the two distributions of x are normal and have the same 
variance, we can then say that if the difference between the population mean values 
of x were (1) as low as d X) or (2) as high as d 2t then the chance would be a of 
obtaining in pairs of random samples of n x and n 2 : (1) a positive deviation of z y or 
(2) a negative deviation of z as great or greater than that observed. But it is not 
possible to speak in any exact sense of the odds being 1 - 2 a to 2 a that the difference 
in means lies between and d*. Such a use of the inverse probability would 
involve an assumption regarding the & priori probability distributions of the 

* Metron , 1925, Vol. v. No. 3, p. 7; Statistical Methods for Research Workers, 1928, p. 107. 

The relation of this and the single sample 2 -test to the criterion of likelihood was disousBed by 
Neyman and Pearson in Biometrika, Vol. xx A . pp. 190 and 207. The symbol 2 will be used throughout 
this paper. 41 Student's 1 ’ later tables in Metron , Vol. v. are entered with t - 1 , and the ^-notation is 

that used by Fisher. In equation (2) c, has the value T (i (j^ + n* - l))/{r (i (m+nj - 2)) 
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population means and standard deviations; and further in the case where it is not 
even certain that these populations are exactly normal a more complex A priori 
assumption still must be introduced, so that any approach to an exact solution in 
terms of inverse probability becomes impossible. The difficulty and one methdd of 
treating it was discussed briefly by “Sophister” in the last number of this Journal 111 
in dealing with the distribution of z found on sampling from a skew population. 
With the fuller results now available it will be possible to analyse the situation a 
little more in detail than he was able to do last year. 

It is true that it may be more helpful for the practical worker to look at his 
problem from the inverse point of view, and to obtain some measure of the odds 
for or against the population parameters lying within certain limits. But a little 
reflection suggests that unless he is prepared to grapple with A priori probability, 
his justification in the use of any such rough and ready guide must depend on 
the validity of employing the probability tables of the ^-distribution in dealing with 
the following questions: 

(a) There is a sample of n individuals measured for a certain character. We 
wish to test the probability of the hypothesis that this sample has been drawn from 
a population whose mean is at a distance m = x — a from the sample mean x. 

(b) There are two samples of n x and w a , and on the assumption that they come 
from populations with the same variance, we wish to test the hypothesis that the 
means in these populations differ by df (d of course will often be zero). 

In discussing the various experimental results we shall therefore be concerned 
chiefly with the adequacy of the “normal theory " in testing these two fundamental 
hypotheses. There are many cases in which the problem presents itself in almost 
exactly one or other of these forms, but the reader should find no difficulty in inter¬ 
preting the tables given in any manner which seems more applicable to the particular 
form of problem with which he is concerned. 

In practice an hypothesis will be accepted or rejected with a varying degree of 
confidence; no precise line between acceptance or rejection can be drawn. Yet 
some light is thrown upon the problem if it is supposed for the moment that this 
vague edge of uncertainty can be given precision, the statistician being compelled to 
make a definite decision one way or the other; he will reject the hypothesis when 
P z < 2a { and accept it when P z > 2a, where the value of a used will depend upon the 
nature of his problem. If this be so errors in judgment cannot be avoided, and it 
is seen that they will be of two kinds: 

(1) The hypothesis is rejected when it is in fact true. 

(2) It is accepted when it is false. 

* Biometrika , Vol. xx 4 . p. 421. 

t The question of finding the probable limits d x and d* in the commercial problem suggested above 
seems really to consist at bottom in testing the seoond of these hypotheses for varying values of d. We 
can then get a good appreciation of these limits without attempting to assign numerical odds to the 
chance that the difference in population mean lies between them. 

$ In the oase of two samples P t is as in equation (2) above. For the single sample hypothesis 

JP,=9J Co(l+**) * dt, where c,=r(i«)/{r(l(n-l)) 
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It is impossible to estimate the relative proportions in which these two types 
of error occur, as it will depend upon the kind of problems to which the statistician 
applies his tests, but we can analyse separately each type. 

If the populations sampled are all of a form such that z follows “Student's” 
distribution, then the source of error (1) may be completely controlled. In the long 
run such errors will be committed in 100 x 2a °/ D of the cases in which the 
hypothesis tested was really true, and the statistician may choose a according to 
the risk he is prepared to take of making this form of error of judgment. From this 
point of view, ] z |* is as good a criterion as z . Suppose that ± z* are the deviations 




Fig. 1. 


corresponding to tail areas of a when “Student's'’ tables are entered, but that 
in fact, as the population is not normal, the distribution of z in repeated samples 
follows a certain skew curve. The tail areas beyond ± z a are now a x and a 2 which 
are not equal. But if we know that a t -f a 2 = 2a (or very nearly so) and this for a 
wide range of values of af, then our control of the first source of error will be as 
good in sampling from the non-normal population as from the normal one. 

We shall therefore first consider below how far \z\ follows “Student's” 
distribution in samples from a variety of non-normal populations J. We may note 
here, incidentally, that any other statistical constant, z\ for which the sampling 

* The expression | z | indicates that the numerical value of z is to be given a positive sign. 

f Say between a = *100 and *005. 

X Previous experimental work in this direction has been carried out by Shewart and Winters, 
Journal of the A merican Statistical Association , Vol. xxm. pp. 144—53; Neyman and Pearson, Biometrika , 
Vol. xx A . pp. 197—207, using Church’s sampling data; "Sophister,” Biometrika , Vol. xx A . pp. 408—21, 
and Rider in the present volume, pp. 124—43. 
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distribution of | z' | is as invariable for changing populations as | z |, will be of equal 
value as a criterion in so far as the control of source of error (1 ) is concerned L 

When, however, we consider the second type of error, the position is somewhat 
different. It will not generally in practice be of serious consequence if we accept 
the hypothesis tested when in fact the moan of the sampled population differs by 
some small quantity, t, from the supposed value a ; nor in the case of two samples, 
if the population means differ by d + t instead of by d . This cannot be avoided, 
but we should like to have some appreciation of the rapidity with which the untrue 
hypotheses are rejected as t, or rather the ratio of r to the population standard 
deviation, increases. If we accept the hypothesis when P z > 2a, are we likely to be 
doing so when really the true population mean is at a distance of o* or perhaps 
even 2<r from its supposed position ? Or in the second test, when the means of 
the two populations really differ by d 4- cr or even d 4- 2<r ? We are concerned now 
with what may be termed the sensitivity of the test in the rejection of false 
hypotheses, and this will depend upon (a) the size of the sample, (h) the form of 
the population sampled, and (c) the sign of t. It has been pointed out that in 
testing any given hypothesis there will be an indefinite number of criteria which 
will ensure the control of the first source of error, but it seems probable that for 
each type of population there will be one of these which is more efficient than 
any other in controlling the second error. This point will be examined in more 
detail below in connection with the experimental results, and the sensitivity of 
“ Student’s ” z and of the ratio z — sample centre/(half sample range) * will be 
compared for samples of 5 and 10 from a variety of populations. 

2. The Populations Sampled. 

No experimental programme could possibly cover all the populations likely to 
be met in common experience, but a variety of types of frequency form have been 
represented by taking samplings from Pearson-curves of the following nature: 

TABLE I. 


Population 

Curve 

pi and p 2 °f 
grouped distribution 

Samples 

S.D. of population in 
terms of grouping unit 

Type II 

0, 2*50 

( 1000 of 2 

63*25 



500 of 5, 500 of 10 

6*32 



500 of 20 

10*54 

Type VII 

0, 4-12 1 

[ 1000 of 2 

56*67 


i 

1 500 of 5, 500 of 10, 500 of 20 

5*67 

Type VII 

0, 7 07 

f 1000 of 2 

64*48 



(1000 of 5, 600 of 10, 600 of 20 

6*45 

Type III 

•20, 3-30 

( 1000 of 2 

5000 



1000 of 6, 600 of 10 

6*00 



600 of 20 

6*67 

Typo III 

•60,3-73 | 

\ 1000 of 2 

5000 

1 

11000 of 6, 600 of 10, 1000 of 20 

6*00 


* The use of the “centre” or mid-point between extreme observations in the sample as an estimate 
of the population mean was discussed in Biometrika , Yol. xx A . pp. 212 and 358. 
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The sampling was carried out with the help of Tippett's Random Numbers*. 
Certain results obtained from the sampling of the three symmetrical populations 
have already been published f. The saihples of 5 and 20 from the skewer of the 
two Type III populations are those obtained by “ Sophister ” $, who has kindly 
placed his data unreservedly at our disposal. To these we have added samples of 
2 and of 10, and have taken a fresh Type III population to fill in the gap between 
his population and the normal population. 

In drawing samples of 2 from a grouped population distribution, both indi¬ 
vidual values will occasionally fall in the same group so that the value of z becomes 
indeterminate. And even if the values fall into groups, one or two units apart, 
considerable uncertainty must exist as to the true value of z if it be supposed that 
the population distribution is really continuous. By taking a very fine grouping, 
i.e. 50 or more groups to the population standard deviation, this difficulty was 
reduced to a minimum. Only about 10 cases occurred in the 5000 samples of 2 
in which both individuals fell in the same group ; these cases were discarded and 
fresh samples taken, and it was assumed that no serious systematic error would 
arise in other cases if z were calculated on the assumption that the variates had 
mid-group values. 

TABLE II (a). 

Frequencies of z in 1000 Samples of 2. 


1 r 




Populations Sampled 



1 ~ 1 

greater than 

ft = 0-00 
02 = 1-80 

0! = O*OO 
02 = 2*50 

ft = 0-00 
ft=8-00 

01 = 0*00 

02 = 4* 12 


0 1 = O*2O 

02 = 3*30 

01 = 0*50 

02 = 3*73 

0*0 

1OOO-0 

1000 

JOOO-O 

1000 

1000 

1000 

1000 

0-5 

660-7 

682*5 

704-8 

716*5 

723*5 

731*5 

699 

1*0 

500-0 

487 

500-0 

512 

526 

609*6 

487*6 

1*5 

400-0 

373 

.774-4 

374*5 

372*5 

376-5 

355 


888‘3 

308*5 

295-2 

296*5 

272 

290-5 

280*5 


285-7 

240*5 

242'2 

233*5 

204 

226 

231*5 


250-0 


204-8 

199*5 

173 

188 

196 


222‘2 

193 

177-2 

170'5 

147 

152*5 

168*5 

4*0 

200-0 

178 

156-0 

156 

131 

134 

160 

4*5 

181:8 

162 

189-4 

147 

113 

119*6 

1 126 

.VO 

166-7 

145 

125-6 

132 

109 


1 116 

6*0 

142-8 

119*5 

105-2 

112 

97 

84 

101 

7*0 

125-0 

09 

90-4 

100 

88 

74*5 

88*5 

8*0 

111-1 

90*5 

79-2 

87*5 

76 

67*5 

78*5 

0*0 

100-0 

80 

70'4 

79 

68 

64 

71 

10*0 

90-9 


68-4 

69*6 

62*5 

57 

6b 

15-0 

62-5 

50 

W'4 

42 

42 

39-5 

* 36 

20*0 

47-6 

31*5 

81-8 

30 

29*5 

28 

26*5 



— 

*137 

_ 

*684 

•006 

*394 

•871 

Goodness 

' ' \n 

*— 

17 

— 

17 

17 

17 

17 

of Fit 

* \ p , 

— 


_ 

_ 

. 

*242 

*022 


w 



1 

— 

— 

26 

26 


* Tract* for Computers % No. xv. A fresh sampling was carried out for each of the 20 sets, and the 
columns of the sampling book and the number scale were frequently altered so as to ensure as far as 
possible complete independence between the different sets, 
t Biometriha , Vol. xx A . pp. 856—60. 

$ Biometriha , Vol. xx A . pp. 889—423. 
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3. The Single Sample Test. 

Tables II (a), (6), (c) and ( d ) give the results of the sampling. In the first 
place they show the number of samples in which | ar | lay beyond the limit given 
in the leading column. The figures in italics are theoretical values, the others 


TABLE II (6). 

Frequencies of z in 500 Samples of 5. 



experimental. Thus to take Table II (a), we find among 1000 samples of 2 the 
following numbers having | z | > 5*0, that is to say with z outside the limits - 5*0 


and 4- 5*0: 


* Figures in these oolamns reduced from results for 1000 samples. 

f Standard error of Mean & for 1000 samples from a normal population. The standard error of <r f 
would be theoretically infinite were the sampled population truly normal. 

X This value of cr g has been calculated, omitting one very divergent value of t of - 8*5; including it 
<r.= *8198. 
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Small Samples from non-normal Populations 


Rectangular Population, 166*7 (theory)*; Symmetrical Platykurtic Population 
(#2 = 2*5), 145; Normal Population, 125*6 (value from “Student’s” tables); Sym¬ 
metrical Leptokurtic Population (/8 2 = 4*1), 132; etc. 

TABLE II ( c ). 

Frequencies of z in 500 Samples of 10. 



The last columns of Tables II (6), (c) and ( d ) give the corresponding frequencies 
obtained on the assumption that the distribution of z is normal, with a standard 
deviation of 1/Vn —3 or 1 /^2, 1 /V 7 and 1 /Vl 7 respectively. 


* For samples of two, z is the same as z\ or the ratio, centre/(4 range), for which the distribution 
II = 4 (1 + 1 z' |)~ 2 was given in lHometrika , Vol. xx A . p. 211. 

f Figures in this column reduced from results for 1000 samples. 

X Standard errors of Mean z and <r t for 500 samples from a normal population. 
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In the lower part of Tables II (a), (6), (c) and ( d) are given: 

(1) The result of applying the (P, x 2 ) test for Goodness of Fit, the theoretical 
distribution being in each case that of the Type VII 2 -curve of “ normal theory/' 
In the test for | z |, corresponding positive and negative values of z have been 
combined, or the 2 -curve was doubled over about 2 = 0 ; in that for z the positive 

TABLE II (d). 


Frequencies of z in 500 Samples of 20. 






Populations Sampled 



Normal 

1 r 








distribution 
with S.D. 

= 1/^17 

1 - I 

greater than 

fa =0-00 

j9,=0-00 

Pi =0-00 

ft = 0-00 

01=0-20 

ft=0*50 



08=2-50 

02=8-00 

02=4-12 

02=7-07 

ft=8-80 

02=8-78 


500 

noo-o 




600* 


•05 

415 

m-o 

402 

409 

412 

413 

418-3 

•10 

333 

333-9 

324 

317 

323 

332*5 

^§gf|; 

•15 

253 

200-5 

251 

244 

240 

261 

268-1 

•20 

182 

197-1 

196 

176 

194 

198*5 

20k'8 

•25 

136 

m-7 

147 

125 

145 

150 

151-4 

•30 

101 

103’8 


84 

98 

105 

108-0 

•35 

69 

71-8 


54 

65 

74*5 

74-5 

•40 

46 

ll8-7 

51 

38 

52 

57 

49-6 

•45 

35 

32-8 

32 

21 

36 

37-5 

31-8 

•50 

26 

21-1 

20 

11 

25 

26 

19-6 

•55 

20 

13*5 

15 

6 

12 

16-5 

11-7 


13 

8-5 

10 

1 

11 


6-7 

•05 


5-3 

3 

1 

7 

7 

3-7 

•70 


3-3 


1 

5 

6 



\-\\ p 

•535 

_ 


•487 

•172 

•797 

_ 

Goodness 


11 

— 


11 

11 

13 

— 

of Fit 

, \ p 

___ 

. 

■ 

, 

•022 

•049 

— 



— 

— 

XH9B Mi 

— 

20 

20 

— 

Mean z 

+ *0006 

0 

s.e. -om t 

-•0056 

+ •0135 

-•0217 

-•0223 

— 


•2439 

S.E. -0084+ 

•2409 

•2187 

•2436 

•2494 

— 


and negative values were kept separate, n' is the number of groups used in 
applying the test. 

(2) For samples of 5, 10 and 20, Mean z and <t> are given The results have 
been represented graphically in Figures 2 (a), (6), (c) and (d). The continuous 

* Figures in this column reduced from results for 1000 samples. 

t Standard errors of Mean z and <r, for 500 samples from a normal population. 

t Here and throughout this paper the standard error of a standard deviation has been taken as 

, where a and ft are the constants of the theoretical distribution of the variable, and N is 
the number of samples upon which the value of the standard deviation has been based. 

























Scale of |z| 


Egon 8. Pearson 


269 


9oU«rfP z 



0*20 0-40 0*60 0*80 1*00 1*20 

Scale Of |z| 

Fig. 2 (6). 



Probability Distribution of |z|: samples of to 

-For Students Type VII Curue 
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curve shows the change in the P z of “normal theory” as \z \ increases, and the 
upper and lower limits of the shaded belts represent the highest and lowest 
observed frequencies of the corresponding row of Table II when divided by 500 or 
1000 as the case may be. To give some indication of the sampling variation that 
might be expected to arise at different points on the 2 -scale if the true distribution 

Sc&le of |z| 



followed “Student’s” curve, lengths equal to twice the standard error of these 
reduced frequencies have been plotted on either side of the continuous curve*. 
But of course the systematic manner in which the frequencies differ in certain 
cases from “normal theory,” as shown in the tables, makes it clear that the width 
of the shaded belt has at any rate sometimes a real significance. Owing to the 

# If P t is as defined in the footnote to p. 261 above, then the standard error of the proportion of 
iV samples in whioh | * | is greater than a certain value is 


Scale of P- 
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inevitable sampling fluctuations too much stress cannot be laid on any single 
difference taken alone, but in combination the results support one another. We 
shall consider them briefly in order. 

Samples of 2. 

Taking the symmetrical populations, the progressive change in the frequencies 
corresponding to a given value of \z\ as we pass from the rectangular population 
to the most leptokurtic population ($ 2 =7*07) is very marked. The changes are 
Jess clear in passing from the normal population down the Type III line; this js 
the result of doubling over a skew ^-distribution, but for reasons given above 
we shall be content in the present paper with considering the distribution of | z | 
only *. 

Samples of 5. 

The table and diagram show that a normal curve with er z = ljs/2 provides a 
very poor approximation to “Students” curve. For the symmetrical populations, 
a z diminishes steadily as the population $ 2 increases, and the tail frequencies in 
the 1000 samples of 5 from population ($ 2 = 7’07) are quite clearly less than those 
expected on normal theory. The correspondence for the other four populations is 
really very good. The distribution of z (not doubled over) is, however, very skew 
in the case of the population (& = *50, $ 2 = 3*73). This is u Sophister’s ” case and 
has been fully discussed by him. } 

9 

V) 

Samples of 10. ' j* 

Figure 2 (c) shows that the line representing the tail area of the normal c 'e 
with <r*=l/V7, although still differing rather widely from the line representing 
“ Students” curve, now falls largely within the shaded belt. For | z | > *6 the 
difference between the two curves is not great. For the samples from the three 
symmetrical curves, <r z lies very close to the “normal theory” value, and the corre¬ 
spondence in frequencies is good. For samples from population (^ = 0, $ 2 = 7*07) 
there is curiously no evidence of the shortage of frequency in the tail which appears 
for samples of 2, 5 and 20. For the two Type III populations there is an excess of 
high values of \ z\ which shows itself in the upper limit of the belt in the diagram. 
This tendency is not at all evident in the 1000 samples from Church's population 
(ySx = *22, $2 = 3*16), but great caution must be exercised in drawing conclusions 
from apparent differences in these cumulative frequency distributions. If we take 
the distribution of \z\, (a) for the 500 samples from the Type III population 
{/3 t = *20, $ a * 3*30), and (b) for the 1000 samples from Church's population ($ x — *22, 
$ 2 = 316), and apply the (P, x 2 ) difference test, we obtain for 11 groups a P of 
*592; that is to say the observed differences which look large in the columns of 
Table II (c) are not inconsistent with a common theoretical law of distribution 
for | z |. 

* That the distribution of z for samples from the skewest population is also skew, is shown by the 
drop in value of P from *871 (test for | z ]) to *022 (test for z) in the goodness of fit tests. 
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The distribution of z (not doubled over) for samples from “ Sophister’s ” popula¬ 
tion (&= *50, St * 3*73) is definitely negatively skew. 

Samples of 20. 

Figure 2 ( d ) suggests that we have now reached a size of sample where the 
normal curve with standard deviation l/>/n-3 represents the ^-distribution very 
well. The observed distributions of | z | are quite closely represented by “ normal 
theory” in all cases except that of the extreme leptokurtic population. Here 
there is again a shortage of high values of \z\, although judged by the test of 
goodness of fit this difference is not exceptional. For the Type III populations 
the distributions of z are again skew. 

A completely satisfactory analysis of the position will only be possible when the 
theoretical distribution of z in samples from any non-normal population has been 
found. But in the meantime these results enable a good appreciation to be formed 
of the extent of variation from “ normal theory ” that may be expected in sampling 
from a fairly wide variety of populations*. They suggest that within this range 
there will not in practice be a danger of any serious loss of control of the source of 
error (1), if \z\ be assumed to follow “Students” law. The least satisfactory 
agreement occurs among the samples from the very leptokurtic population (# 2 “ 7*1). 
Taken together, we find that the 21 tests of goodness of fit for \z \ give a mean 
value of P of *463; even if the variations were all due to chance we should only 
expect a value of *500. 

In Table III a comparison is made at about the level P z = ’04 of the chances, 
theoretical and observed, of obtaining | z | greater than the values indicated in the 

TABLE III. 


Comparative values of P z near *04. 



Population 

Normal 

distribution 

with 

S.D.= 1 / n /^8 

n 

■ 

ft=0*00 
0,=2*60 



/3, =0*00 
( 83 = 7*07 

ft=0*20 

ft=8*30 

ft = 0*60 
ft = 8*78 

2 

H 

■I 

•042 

•042 

mm 

mm 

•036 


5 



-040 

•038 


■II 

•042 

-034 

10 

■a 


• 040 

•038 

HU 

■MB 

*062 

-034 

20 

0*5 

KB 

•0J& 

•040 

KB 

•050 

•052 

-039 


2nd column. The observed frequencies have been divided by 500 or 1000 according 
to the number of samples, and the figures are therefore subject to sampling errors. 
But even if they represented the true values of P z in sampling from the corre¬ 
sponding populations, the differences between them and the “normal theory” 

* Certain incomplete results suggest that the population skewness cannot be increased much further 
without beginning to modify the distribution of | z | appreciably, 
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values, as shown in the 4th column, are hardly large enough to lead to any serious 
errors in inference. 

With characteristic intuition “Student” anticipated the adequacy of his test 
in sampling from symmetrical leptokurtic systems more than twenty years ago 
in his original paper*; the idea of the “doubling over” in the case of skew 
populations lies also to his credit. 


4. The Two Sample Test. 


We may now examine the adequacy of Fishers two sample 2 -test in controlling 
the source of error (1) when sampling from non-normal populations. If two inde¬ 
pendent samples of n x and n 2 are drawn from the same normal population, then 


2 = 


_ tT l ~ X 2 _ 

\/ T?!^ 2 + n 2 s 2 2 



n x n 2 


(3) 


is distributed according to the law 

111 + 1 * 8-1 

y = constant x (1 + z 2 ) 2 .(4). 

Equation (4) is the distribution of z in the single sample problem with n x + n 2 — 1 
written for n . It does not, however, necessarily follow that when sampling from a 
non-normal population the distribution of z = mjs for n = 14, let us say, will be the 
same as that of the z of (3) when 7*1 = 5, h 2 =10* This fact is illustrated in the 
case of sampling from the leptokurtic population (/3 X = 0, $ 2 = 7*07); there is here a 
considerable positive correlation between the values of m and s in a sample. Large 
deviations in mean tend to be associated with large deviations in standard devia¬ 
tion, and as a result the preceding tables have suggested that the ratio, 2 , is 
slightly less variable than on “ normal theory.” But if we combine the samples of 
5 and 10, taking n x = 5, n 2 = 10, and calculate the 2 of (3), the most variable term 
in the numerator is x lt the mean of the smaller sample, while the most important 
term in the denominator is 10 8 2 2 f which is quite uncorrelated with x x . There is 
not compensation, therefore, as in the previous case, and as a result the 2 is some¬ 
what more variable than that of “ normal theory.” 

Tables IV (a), (6) and (c) show the result of pairing together samples of 
(a) 5 and 10, (6) 5 and 20, and (c) 10 and 20, from the three populations (0, 2*5), 
(0,7 07) and (*50,3‘73). Results for the other two populations sampled are not yet 
available j*. The tables show the observed and theoretical frequencies lying beyond 
certain values, not of 1 2 |, but of a multiple of | z | (as shown at the head of the 
leading column), which was a simpler ratio to obtain in the computation. The 
values of Mean 2 and cr z are, however, given below, as well as the results of testing 
the doubled-over distribution for goodness of fit. The skewness and goodness 
of fit of the undoubled-over 2 -distributions have not yet been examined. The 
final columns of each table contain the frequencies found from a normal curve 
with standard deviation equal to l/Vw'x + n z — 4; even for samples of 5 and 10 
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t See Addendum, p. 285. 
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these frequencies never differ very widely from those of the true “ normal theory ” 
s-curve. The sampling results as they stand do not suggest that the distribution 
of | z | varies in any simple way as the sampled population changes. But this could 
hardly be expected owing to the complex structure of the ratio, in which is 
correlated with ft 2 , and with s 2 2 » but with no cross correlation. It seems justifiable, 


TABLE IV (a). 

Distribution of z in Pairs of Samples of 5 and 10. 
Frequencies in 500 pairs. 


•Jvs |*| 

greater than 


Populations Sampled 


Normal 





distribution 

with 

S.D. = 1/^11 

ft = 0-00 
ft=2-50 


ft=0-00 
ft = 7-07 

ft =0-50 
ft-=3*73 

•00 

500 

500*0 

500 

500 

500-0 

•05 

447 


459 

443 

urn 

•10 

i 402 

386-o 

399 

390 

393*3 


352 

3.H.V0 

341 

337 

342*3 


297 

283*0 

295 

277 

aojf’O 

U 

235 

237-4 

256 

232 

249-2 

•30 

196 

urn-/,. 

222 

197 

208*3 

*35 

157 

wo-h 

184 

156 

171*0 

•40 

131 

130-0 

153 

130 

139-4 

*45 

106 

104-0 

121 

99 

111-5 

•50 

86 

82*5 

102 

85-5 

87-9 

•55 

67 

04-7 

86 

72 

08-2 

•60 

44 

00-4 

67 

53 

52*1 

' *65 

34 

30-0 

61 

44 

89*2 

•70 

23 

30-0 

49 

33 

29*0 

*75 

15 

22*9 

35 

23 

21*1 

•80 

12 ! 

17*4 

23 

18 

15*1 

•85 

9 

13*2 

19 

14 

10*7 

•90 

9 

10*0 

18 

10 

7-4 

*95 

7 

7*3 

10 

9 

5*0 

TOO 

7 

! 

5*7 

7 

6 

3*4 


•161 

_ 

•034 

•591 

— 


16 

— 

16 

16 

— 

Moan z 

- *0116 

0 

S.E. -0135 

4- *0141 

-•0071 

— 

<r* 

•3019 

•3015 
S.E. • 0110 

•3337 

•3024 

— 


however, to conclude, after examining the tables, that the practical worker will be 
led to make no very serious error of judgment if he refers the value of z to 
“Student’s” tables (or even to the normal tables with <r* = 1/V% + n g — 4) when 
examining the difference between the means of pairs of small samples, taken from 
moderately skew, leptokurtic or platykurtic populations. Possibly the position 
might be less satisfactory if n x and n g were below the values of 5 and 10. 


19—2 
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The average of the nine values of P found in the goodness of fit tests is now 
*332. The standard errors given for Mean z and a z are for 500 samples from a 
normal population. 

TABLE IV (b). 

Distribution of z in Pairs of Samples of 5 and 20. 

Frequencies in 500 pairs. 




Populations Sampled 


Normal 

iJr, |i| 
greater than 





distribution 

ft=()-00 

ft = 0*00 



with 

S.D. = l/\/21 


ft 2 = 2*50 

/3. 2 = 8-00 

ft, =5 7 *07 

ft, = 3-78 

mm 


500-0 

500 

500 

noo-o 

*05 

411 

416-0 

412 

417-5 

418*8 

*10 

332 

iWG-0 


340-5 

340-9 

•lf> 

255 

263-2 


260 

269-3 

•20 

198 

199-0 

224 

183*5 

206-2 

•25 

143 

147-3 

162 

139 

152-8 

•30 

106 

105-4 

123 

101 

1U9-4 

•35 

64 

73-4 

82 

72-5 

75-7 

•40 

49 

49-S 

53 

47 

50-6 

•45 

33 

33-0 

36 

30*5 

32-6 

•50 

23 

31-b 

23 

18 

20-2 

*55 

13 

13-6 

16 

8 

12-1 

•60 

7 

3-5 

10 

4 

7*0 

*65 

6 

5*2 

4 

1*5 

3-9 

*70 

2 

3*2 

3 

0*5 

2-1 

•75 

2 

1-9 

3 

— 

1-1 

•80 

2 

1-1 

2 


0-5 

Good nous ( P 

•610 

_ 

*397 


_ 

of Fit 

11 

— 

11 


— 

Mean z 

-•0094 

0 

S.E. • 0098 

+ *0099 

- -0058 

— 

<r$ 

•2175 

■31ft3 
S.E. • 0074 

•2280 

v *2088 

— 


5. Examination of the Second Type of Error. 

Suppose that on finding a value of z such that P g > 2a (say, > *10 perhaps), it 
is decided to accept tho hypothesis that the mean of the sampled population has 
a value 6. How often is this likely to occur when in fact the true population mean 
lies at a instead of 6 ? In such a case “ Student’s ” tables will have been entered 
with £=(£ — b)/s instead of with ze=(x — a)/s, and the error in judgment will 
arise because the test is not sensitive enough to detect this fact What we require 
is to have, for different values of (a - 6), some appreciation of the chance that 
z * < ?< + for the smaller the chance the more effective is the control of 
this source of error. 

* z A being, as above, the value of z giving P,=2a. 
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The position may be explored with the aid of the experimental sampling results. 
We have fixed in the first place on two different values of a, *05 and *01, which 
backward interpolation in “ Student’s ” tables shows to correspond to deviations (*.) 
of 1*066 and 1*873 for n * 5, and of *611 and *941 for n = 10. We have then chosen 
out randomly for each of the five sampled populations, and also for a normal 
population*, 100 of our samples and have given in succession to (a - b) the values 

TABLE IV (c). 

Distribution of z in Pairs of Samples of 10 and 20. 

Frequencies in 500 pairs. 


oJl-5 | z | 


Populations Sampled 


Normal 





distribution 





greater than 

o o 

O 

o c* 

If II 



j9!=:0-50 

(82=8-78 

with 

S.D. = 1/^20 


500 

600-0 

500 


500-0 

■: ' 

411 

415-3 

410 

416 

417-5 


310 

334* 

329 

330 

338-6 


239 

201-1 

258 

273 

266-2 

1 

174 

197-4 

196 


202-5 

•25 

124 

lU'fi 

144 

155 

149-0 

•30 

97 

102-7 

105 

119 

105-8 

•35 

75 

70S 

76 

82 

* 72-5 

*40 

51 

47-4 

55 

59 

47-9 

•45 

32 

30-9 

31 

34 

80-5 

•50 

25 

19-7 

19 

19 

18-7 

•55 

16 

12-2 

12 

8 

11-0 

•00 

12 

7-4 

8 

3 

6-2 

•05 

8 

4-4 

5 

3 

3-4 

•70 

4 

2-6 

4 

— 

1-8 

*75 

1 

1-5 

2 

— 

0-9 

GoodneBS f P 
of Fit \n f 

•104 

_ 

*917 

•092 

_ 

11 

— 

11 

11 

— 

Mean z 

•0000 

0 

S.E. -0088 

-•0036 

-•0085 

— 

<Tm 

*1947 

-am 
S.E. -0006 

•1972 

•1997 

— 


<r/>/n f 2 c/fn f 3 <r/Vn,..., etc. (n = 5* and 10), where is the standard deviation of the 
population sampled, (a — b) has then been added in each case to the observed 
deviation in the sample mean and the result divided by s , the corresponding sample 
standard deviation, to give £ In samples from a normal population, if (a — b) were 
zero, the percentage of values of which should lie in the long run within 

the limits ±z mf should be 90 for a = *05 and 98 for a = *01. For the non-normal 

* One hundred random samples of 5 and 10 were specially drawn from a normal population for this 
purpose. 
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population, the results discussed in Section (3) above suggest that these percentages 
will also be fairly nearly approached. As (a - 6) is increased from zero the number 
of values of f found between these limits will decrease and will represent the 
percentage of false hypotheses accepted. The situation can be explained most 
clearly by turning to Table V. 

This table shows the percentage of samples for which — £«<?<+£« when 
different multiples of 0* cr/Vn have been added to or subtracted from the sample 
mean. Suppose for instance that we have a sample of 10 from the population 
(& = 0, /S 2 = 7*07) and wish to test the hypothesis that the population mean lies at 
6, and decide to accept it if | £ | = j (x -b)\/s < z a - Then the experimental results 
suggest that 

(1) if the true population mean were to lie at a = b ± 2cr/V 10 = 6 + *63 <j instead 
of at 6, we should in repeated sampling accept 38 °/ o of these false hypotheses if we 
took a — ‘05 as the critical level, and 67 / if we took a = *01 ; 

(2) if the true population mean were to lie at a = 6 ± 4<t/n/ 10 = 6 ± l*26cr we 
should accept in repeated sampling only 2 °/ Q of these false hypotheses in taking 
a= 05, and 9 C / Q with a = *01. 



The table therefore shows the sensitiveness of " Student’s ” test in rejecting false 
hypotheses when applied to samples from various populations. We may comment 
on the results briefly as follows: 

(а) For symmetrical populations the test will be equally sensitive whether (a - 6) 
be positive or negative. Multiples of ajsfn were therefore only added to the observed 
samples *. 

(б) There is extremely little difference in the degree of sensitiveness among the 
samples from the four symmetrical populations, the percentages being of course 
subject to sampling errors. 

(c) The Type III populations were both positively skew, and the position is 
represented diagrammatically in Figure 3. An examination of Table V shows that 

* For convenience in comparison with the results for the skew populations the figures for the 
normal population have been repeated on the negative side. 





TABLE V. 

z-Test. Table showing Percentage of False Hypotheses accepted when the True Population Mean lies at 
Increasing Distances (a — b) from its Supposed Position. (0 - aj\ f n.) 

Values of (a — b). 
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Sample 
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for these two populations the test is quicker in rejecting false hypotheses when the 
true population mean is, as shown in the figure, to the right of its supposed position 
than when it is in the other direction. 

(d) For values of ( ct — b) with the same sign as the population skewness the test 
appears to be slightly more sensitive in rejection than in the normal case. But for 
the opposite sign the position is distinctly less favourable when the populations are 
skew. In other words when dealing with skew populations there is more danger of 
failing to detect a faulty hypothesis when the long tail of the true population 
distribution points towards the position of the supposed mean than when the steep 
tail does. In certain problems the direction of the skewness, if not its exact magnitude, 
may be clear; in such cases we shall know that the chance of error is less completely 
controlled in one direction than in the other. 

(i a ) The control of what has been termed the first source of error is as good for 
small samples as for large, provided that the population is such that | z | follows 
approximately “ Student’s ” law. It is in dealing with the second source of error 
that small samples are at a disadvantage. Suppose for example we are dealing with 
normal populations and on obtaining a sample (x f s) decide to accept the hypothesis 
that the population mean lies at b whenever f = (x — b)js < z a or P z > 2a = ‘10, say. 
Then a rough interpolation in Table V suggests that for samples of 5 we may be 
accepting the hypothesis in as many as about 42 °/ Q of cases where the true popula¬ 
tion mean differs from b by as much as the population standard deviation; while 
in samples of 10 this will happen only in about 9 % of such cases*. For samples of 
20 the risk would be almost negligible. There is nothing new in this except perhaps 
the method of approach; it is the old tale that no conceivable method of statistical 
analysis will enable differences below a certain limit to be detected from the evidence 
of a single small sample. 


6 . An Alternative Test. 

In a recent paperf it was shown that in sampling from a rectangular population 
the appropriate criterion to use in testing a hypothesis regarding the position of 
the mean, a, was not z but the ratio z' = ((? — a)/^R t where 

u and v are the highest and lowest values of the variable in the sample, 

(r is the sample 14 centre,” = £ (u + v), 

R is the sample range, = w — y. 

The theoretical distribution of / in samples of n from this population was 
obtained, and it was suggested that perhaps it might be of wider application, just 
as “ Students ” ^-distribution has been found to be adequate for populations differing 
considerably from the normal. Further analysis, however, soon showed that the 

* For n=5, o = *05 we Lave interpolated roughly between the oolumna a -ft=20 = 2<r/\/5 = ‘894<r and 
a-b=8d==3<r/\/5==l*342<r, i.e. between the percentages 48 and 24. For n=10we interpolate between 
a~b — 8 <rjsl 10 and a - b = 4<r/Vi0. 

t Biometnka, Vol. xx A . p. 212. 
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“ rectangular theory ” /-distribution would not be appropriate for samples from the 
populations of common statistical experience. That this is so is suggested at once 
by an examination of the values of oy found from the sampling experiments and 
given in Table VI. The whole form of the curve also changes. 

TABLE VI/ 

Comparison of Observed Distribution of z' with Empirical “Normal Theory ” 


Populations. 





ft = 0‘00 

o-oo 

0* •• 00 

0*00 

0-20 

n 



j9 a *= 1*80 

2*50 

8‘00 

4*12 

3-30 


Goodness I 

P 


•472 


•558 

•175 


of Fit 1 

n' 

— 

12 

— 

12 

15 

K 

Mean z 


0 

- *0103 

0 

- *0224 

+ •0274 


S.E* 



•0242 

— 

*0242 

*0171 


or,' 


•C>77?i\ 

•5515 

•5418 

•5150 

•5651 


8 .E.* 


— 

*0309 

— 

*0309 

•0218 


Goodness J 

r p 

_ 

•003 

_ 

•063 

•007 


of Fit 1 


— 

13 

— 

13 

13 

10 

Mean / 


0 

+ •0160 

0 

+ *0034 

+ •0737 

S.E* 


— 

•0132 

— 

*0132 

•0132 


or,' 

S.E* 

•1890\ 

*2629 

•2947 

*3169 

*3246 


— 

•0103 

— 

•0103 

•0103 


It seemed, however, worth undertaking the following research: 

(а) Find experimentally the distribution of / in samples of 5 and of 10 from 
a normal population, and by fitting the data with curves obtain empirically “ normal 
theory ” /-curves. 

(б) Test the adequacy of these curves to represent the distribution of / in the 
samples from the three neighbouring non-normal populations, with 0 X and 

(0, 2*5), (0, 41), (0*2, 3*3). That is to say examine the adequacy of these distributions 
in controlling the error (1). 

( c ) As in the case of z , examine the sensitiveness of the /-test in rejecting false 
hypotheses (control of error (2)). 

(d) Make a comparison of the sensitiveness of the z - and /-tests for samples of 
5 and 10 from the same populations. 

* Standard errors for samples of 500 or 1000 if the distribution law of z ' were of the empirical 

•• normal theory” form. 

t These are theoretioal values obtained from equation (xliii), Biometrika , Vol. xx A . p. 211. 
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Let us take these steps in order: 

(а) Mr L. H. C. Tippett very kindly placed at our disposal the 1000 samples of 
5 and of 10 from a normal population which he had used in his work oil the 
Distribution of Range*. He also undertook some preliminary computation. The 
distribution of z' must clearly be symmetrical; the following values were obtained 
by using the 2nd and 4th moment coefficients about z = 0. 

n — 5 oy = *5418 /3 2 = 7*5225, 

n = 10 oy = *2947 fi 2 = 3*4342. 

Type VII curves were fitted to the observations and gave on applying tests for 
goodness of fit, for n = 5, i J = *7l5; and for n== 10, P = *491. These curves were 
taken to represent the standard /-curves of “normal theory/* and the chance of 
exceeding any given value of / could be obtained by interpolating in “Students” 
Tables of t (Metron, Vol. v. No. 3, p. 26). 

(б) The two curves were then doubled over and fitted to the observed dis¬ 
tributions of | / | for the three non-normal populations with the result shown in 
Table VI. The fits appear quite reasonable for samples of 5, but arc no longer 
satisfactory when n = 10. That is to say it would appear that the “ normal theory ” 
/-curves will only represent the distribution of | / | from moderately non-normal 
populations in very small samples. It did not seem worth while attempting the 
fitting in the more extreme cases of sampling from the populations with /£, and 
ft 2 (0*00, 7*07) and (0*50, 3*73). The table shows how, for symmetrical populations, 
oy decreases with yS 2 for n = 5 and increases for n = 10. For samples from the skew 
Type III population the distributions of / are negatively skew, and Mean /, at any 
rate for n = 10, differs quite significantly from zero. In dealing of course with a skew 
population the mean value in repeated samples of O , the “centre/’ is no longer at 
the population mean but at a point which changes as n is increased. 

(c) The sensitiveness of the test in the control of error (2) was examined in 
precisely the same manner as for the £-test. The error arises because on taking 
P? * 2a as the limiting probability f, we find - z* < < + z m \ where = (O — 6)/|i2 

has been calculated instead of /==((? — a)/^R f the supposed population mean being 
at 6, the true one at a. Table VII gives the observed results based as before on 
100 samples in each case. The following appear its most important features:. 

(1) For the symmetrical populations the test becomes less sensitive the more 
leptokurtic the population. This is not connected with the change in oy, which, as 
we have seen, takes place in opposite directions for n = 5 and 10, but arises because 
the z criterion becomes less and less efficient in controlling error (2) as we move 
away from the rectangular population for which it is theoretically most suitable. 

* Biometriha , Vol. xvn. pp. 864—87. 

+ iV = 2 j /(«') dz\ Using the empirical distribution referred to above it was found that 

for a= *05, 3 a '= *852 when n = 5 and z tt '= *482 when n=10, 
for a= *01, z tt ' = 1-404 when n=5 and *710 when n=10. 
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(2) In sampling from the skew population, the control is slightly better than in 
the normal case when {a-b) is positive, but worse when this difference has 
a negative sign. Exactly the same effect was observed in the case of z . 

(i d ) We shall conclude with a comparison between z and z\ For the second 
type of error, this can be obtained by comparing Tables V and VII, but for con¬ 
venience the results for the case a = *05 have been placed together in Table VIII. 
The figures give the percentages of false hypotheses accepted for increasing values 
of (a-b). It will be seen that for samples of 5 the 2 -test is not very much more 
sensitive than the 2 -test, but that when n has increased to 10 the former has 
a very marked advantage. The difference is least for the platykurtic population. 

TABLE VIII. 


Comparing the Efficiency of the z - and z'-Tests in Rejecting False Hypotheses . (0 = cr/V n.) 



n-5 

w = 10 

Pl 

0*0 

0-0 

00 


0-2 


00 

0-0 

00 


0*2 


Pi 

2-6 

3*0 

4-1 


3-8 


• 2 

5 

3* 

0 

4-1 


3*3 









Positive 6 

Negative 0 







Positive $ 

Negative 0 

a-b 

- 

z* 

z 


z 

z' 

z 

z’ 

z 

- 

z 


z 

z’ 

z 

z ' 

z 

z' 

z 


0 

7r> 

77 

70 

77 

78 

82 

Si 

72 

07 

82 

74 

80 

00 

81 

~4 

75 

72 

67 

84 

80 

20 

r>7 

60 

# 

62 

P 

54 

r>4 

51 

38 

63 

30 

56 

44 

59 

41 

57 

31 

46 

40 

66 

30 

38 

28 

n 

29 


35 

u 

21 

10 

28 

18 

29 

w 

29 

n 

33 

0 

25 

18 

44 

40 

7 

11 

7 

14 

a 

15 

3 

3 

7 

13 

8 

9 

1 

13 

4 

17 

1 

6 

3 

20 

60 

60 

0 

1 

1 

7 

j 

t—* 

— 

— 

4 

1 

5 

l 


1 

— 

7 


6 

3 

— 

1 

— 

9 

4 

70 

80 

1 — 

. 

- 

— 

— 

- 

- 

— 

— 

1 

- 

- 

= 

- 



1 

= 

— 

— 

1 

1 


The figures for z have been printed in italics to aid the eye in comparison. 


We have available, therefore, the distribution of two criteria in samples from 
a normal population, one (of z) known exactly and the other (of z') found empirically. 
That of the former, which is in theory ideal at the normal point, has been shown to 
be still applicable for a very considerable variety of population forms. But the 
distribution of z\ while providing complete control of the first type of error at this 
point, begins to lose this control much more quickly than does that of 2 as the 
population form is modified. And further the z criterion is less sensitive than the 
other in the detection of false hypotheses regarding the position of the population 
mean. Were the populations of experience clustered round the rectangular point 
the situation would almost certainly be reversed. 

Owing to the simplicity in calculation it seems, however, possible that in 
problems where the population is known to be approximately normal, the criterion 

z = (u + v - 2 a)!(u - v) 

may be of value in providing a rapid method of testing the validity of a hypothesis 
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regarding a population mean from a knowledge of the two extreme individuals in the 
sample only. This would be in cases where n is not greater than, say, 7 or even 
10; for n = 2 of course z = z\ It is therefore hoped to provide shortly brief tables 
of the empirical “ normal theory ” probability integral of /. 

Elsewhere in statistical theory there may well be cases of two criteria, for both 
of which the frequency distributions in sampling from a given population are known. 
In the case of z and z the method of likelihood expresses in simple logical form 
the reason for the choice of z at the normal point and of / at the rectangular point. 
This method should be applicable in other cases, and its value in picking out the 
right criterion is supported by the conclusions of this paper which have been reached 
by a quite different process of argument. In any case, however, the sensitiveness 
of the tests to changes in population form could not have been gauged except by 
the present form of experiment or by surmounting certain stubborn obstacles in the 
mathematical theory of sampling. 

In conclusion, it is necessary to emphasise the extent to which this paper is 
a result of co-operation. The labour of sampling and computation is far too great 
to have been undertaken by a single individual. Mr N. K. Adyanthaya has been 
entirely responsible for this and other work on the symmetrical population with 
j8 2 = 4T. As has been stated above, the results for samples of 5 and 20 from the 
skewest of the populations have been taken from “ Sophister's ” paper, and acknow¬ 
ledgments have also been made to Dr A. E. R. Church and Mr L. H. C. Tippett. 
Far the greater part of the remaining computing has been courageously undertaken 
by Mrs L. J. Comrie, while other computers have been Miss Marie H. Anderson, 
Mr A. B. Thomson and Mr Ernest Martin. To Mr A. E. Stone we are indebted for 
some 11,000 samples, and the diagrams are the work of Miss Ida McLearn. To all 
these the chief author is exceedingly grateful. 


Addendum : Distribution of z in 500 Pairs of Samples from 
Population fa = 0 20, fa = 3*30. 


Samples of 5 and 10 

Samples of 5 and 20 

Samples of 10 and 20 

vTB |.| 

greater than 

Observation 

Normal 

Theory 

greater than 

Observation 

Normal 

Theory 

| z | 

greater than 

Observation 

Normal 

Theory 

•00 

500 

500*0 



500*0 

•00 


600*0 

•15 

361 

333 0 




•10 

342 

334*5 

*30 

204 

196-4 

*20 

200 

199*9 

•20 

203 

197*4 

•40 

135 

130*0 

*30 

121 

105*4 

•30 

119 

102*7 

•50 

88 

82*5 

*40 

59 

49'8 

•40 

64 

47-4 

•60 

52 

50*4 

*50 

23 

21 ’4 

•50 

25 

19*7 

•70 

29 

30*0 

•60 

9 

8-5 

•55 

15 

12*2 

•80 

14 

17*4 

•65 

6 

5*2 

•60 

8 

7‘4 

•90 

6 

10*0 

•70 

3 


■65 

6 

4*4 

1-00 

5 

5*7 

•75 

2 

1*9 


5 

2*6 

p 

•337 


P 

•441 


P 

*185 


n' 

16 


n' 

12 


n' 

11 



•3023 

•3015 

<r. 

•2204 



•1983 

•1961 




















286 Small Samples from non-normal Populations 

The above results correspond, in somewhat abbreviated form, to those of 
Tables IV (a), (b) and (c) above. The values of cr z show very close agreement with 
“ normal theory,” and the frequencies do not appear to differ seriously. 

For the population fi i = 0:00, #2 = 4*12 the result for samples of 5 and 10 alone 
is available. Testing goodness of fit to “ normal theory ” it is found that P = *350, 
while <r z = *3111 against the normal value of *3015. The distribution of z is 
somewhat too variable, but not as much so as in the case of samples from the 
extremely leptokurtic population ($ 2 = 7*07). 



SAMPLING WHEN THE PARENT POPULATION 
IS OF PEARSON’S TYPE III. 

Br CECIL CALVERT CRAIG, Ph.D. 


Introduction. It is an immediate extension of my thesis* to apply the methods 
there developed in a more detailed way in the study of sampling in cases in which 
the parent distribution is skew. One of the most useful and important of the skew 
frequency functions is the Type III of Pearson. If its equation is written in the form 


/«■>r©<“ + '>^'‘ 


.( 1 ), 


in which 


x — in 


X 


__ 

a -—, and 6 = a 2 (a 3:x = Vft), 


«s 


.( 2 ). 


the semi-invariants, \ r , of this frequency function follow the simple lawf 

K b r ~ l ~ a r ~* ’ . 

(\o = l and \j = 0 since (1) is written in standard units with the origin at the 
mean.) This simple, explicit expression for any semi-invariant invites an applica¬ 
tion of the method of semi-invariants to the case in which the parent distribution 
is of this type. 

The three parameters of the Type III distribution are the mean, m x , the 
standard deviation. a x , and the skewness, a 3:a: . (The measure of skewness is in 
this case just twice that given by Pearson.) The problem is to find the semi¬ 
invariants of the frequency distributions of these three characteristics as found 
from samples of N, each taken from an infinite parent population which is distri¬ 
buted according to the Type III law. 

More explicitly let the infinite parent be given by means of the semi-invariants 
of (2). Let infinitely many random samples of N each be taken and the mean, the 
standard deviation, and the skewness be calculated for each sample. To find the 
semi-invariants, X-. Cx = d r , and = b r , of the frequency distributions of 
the sample means, sample standard deviations, and sample skewnesses thus obtained. 

Section I. The Frequency Distribution of Sample Means. In the case of sample 
means it has already been shown that they also form a Type III distribution if 
the parent doesf. I believe however that it will be interesting to show how neatly 
the same conclusion is reached using semi-invariants. 


* Metron, Vol. vn. pp. 8—75. 

f Steffensen, J. F., MatematUk lagttagtitulare, G. E. C. Gods, Copenhagen, 1928, p. 60. Also 
the development of Section I follows exactly the same lines. 

+ Church, A. E. B., “Means and Squared Standard Deviations of Small Samples from any Popula¬ 
tion.” Biometrika, Vol. xvm. (192C),pp. 385—888. Also, Irwin, J. 0., “On the Frequency Distribution 
of the Means of Samples, etc.” Biometrika, Vol. xix. (1927), pp. 228, 229. 
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If from an infinite population distributed according to a frequency law which 
has \i, Xs,... X r ,... for its semi-invariants, infinitely many random samples of N 
be taken, it is a property of semi-invariants that the distribution of means, m 9) of 
these samples, has semi-invariants, \ r .m x , given by* 

X r . m x = . 

If the parent distribution be of Type III then 

* _ (r-l)la r 

(Nby* 1 . 

Comparison with (1) and (2) suggests that in this case 

(N1A Nb p~ Nb Nh „ 

F (y)=™nxm) {a + y )Nb ' le ^ .< 5 >> 

in which y = It is only necessary to verify this by finding the semi-invariants 
of (5). This is done by equating the coefficients of like powers of t in 

.( 6 ). 

The minimum value of z in (1) is —a and this will also be the minimum value 
for y. Put a + y = m in the right member of (6) and it becomes 

{M) Nb e~ m . „ . (Nb) Nb e~ at f® N , . -J Nb -t), 

~nhi'/\Tu\ m Nb ~ l e a e~ a ~ , d(t> = ... . r , m Nb ~ l e \» /dm 

a m 1 (Nb) ) o a Nb r(Nb)J 0 

^myo 

(?-r (>-sr 

Then taking the logarithms of both sides, 

0it + *,0*t* +±0 3 t 3 +... = -at-Nblog(l - ^) 

_ (at) 2 , (at)* ^ m * 

2A7> 3(Nbf . K }t 

„ % (r-1)1 or 

0r X r : mj[ — 

as anticipated. Note that the coefficient of £° on both sides of (7) is zero; that is, 
the constant term in (5) has already been so chosen that the total frequency is unity. 

The final step is to write (5) in its own standard units. The mean is already at 
the origin since Xi :ma; = 0. Also 

j 

L (8). 

and a 3 . = ft*:-"* = -ffi. _ 2 _ g »‘» 

* cr*m x XW a\/N SN 

* Thiele, T. N., The Theory of Observations , C. and E. Layton, London, 1908, p. 42* 
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If now I write 


■y^N, 


and 

(5) reduces to 


= a W, 

a»:m x 

B = A* = Nb, 

N^B B e~ B B . 

*(*)- a»%~AA + z)*-'e-A> 


a b r (B) 


or 


F(y)<r\ 


- -F( z ) = -Jb?(B)( A + z ) B ~ le ~** 


( 9 ). 


which is the desired form. 

Knowing that the distribution of sample means is of Type III, in practice 
formula (4) is the one that will be used. For example, if for the parent a 3 = 0*5, 
then a = 4 and b = 16. If I choose N = 100, 

Xi: m z ** =* 4, 


x 8: 


% ■ 


J m x ' 




m x ' 


Nb 

01 , 

2a® 

: (Nby 

. X3 : tn x . 


= vr=^ a 


N 


rn x y 


= 0-00005, 


^ 3 — = 0 * 0 5. 


m x 


These are the characteristics of the Type III distribution of means of samples of 
100 taken from an infinite parent population which is distributed according to the 
Type III law. 

In general terms, this Type III distribution of sample means has the same 
mean as the parent distribution, and a standard deviation and a skewness, a 3 , equal 
to those of the parent respectively divided by sfN , where N is the size of each 
sample. 

Section II. The Semi-invariants of the Distributions of Sample Standard 
Deviations and a 3 ’s. 

In my thesis I used the following method for approximating the values of the 
semi-invariants of the distribution of standard deviations in samples of N taken at 
random from an infinite parent population, the frequency law for which is given 
by the semi-invariants, \i,\g,... X,,.... The desired semi-invariants, ... d r ,..., 

are defined by 

f*+h**+h*++". am [ m d<r ie f 1 (<T x )*'**- r .(10), 

J -00 J -00 

in which fi(<r x ) is the frequency function for a* due to sampling and /*(v») is the 

Biometrika xxi 20 
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same for v t . (j/ 2 = oy\) The second form on the right arises by virtue of a well- 
known property of semi-invariants. I write 

Vi — \i + e, 




and then m, — — 

so that \/xi ^1 + ^. 

On substitution in (10), I had 

S't+iiW+h' 1 **- •« j x dvtMvd/^i'+O** .(ii). 

J - 00 

Expansion of the exponential on the right gave 

d>-x,(i + Q-d,> }... (12) , 

cZa“X 2 ® + + + + ^ — 3cZic? 2 ^i 3 

(£4 = X 2 2 (1 + Zb + j> 2 ) — 4rfirf 3 — 3d 2 2 - 6(Z 2 di 2 - di 4 
in which v r is the rth moment about the mean of the samples’ i/ 2 ’s. For the v r s were 
substituted their values in terms of the semi-invariants S r (v 2 ) of v 2 due to sampling. 
In the results g r was written for $ r (i> 2 )/\ 2 r • Also $i(i' 2 ) is of order — 1 in N and 
S r (v 2 ) is of order — (r—1) in N for r> 1. Below are given the results when all 
terms of order —3 and higher in N are retained : 

di = 5ts* (1 + h9x -l 92 -l9i* + re9* + fgffm + i l a9i 3 ~ 1 1*9*~ tiV'Mi ' 

“ Ji»9* ~ ¥19291 *+t$s9z92 + }M92*9i - ^9%) 

di = ^ (92- h9i-9291 + h9i+9*9i + 192* +9291 ~ S19*92 - 1 \9**9i+ Hfi '* 3 ) 

<h = ^ (93 - i 9* - l9*9i ~ W + W-9392 + i l l 929i ~ f I92) 

d * = 16 “ 65,35,2 + 

For further details of these calculations and of similar ones in the case of or 3 
consult my thesis. 

For or 3 essentially the same device was used. The semi-invariants, 6 lf b% t b Zi 
are defined by 

aW+.lV+yjV+ = r da a <f>(a s )^, 

J - 00 

in which <£(a 3 ) is the frequency function of the a 3 fl due to sampling. Using this, 
I wrote 


V...03). 


as before, and 


v 2 = \ 2 + €1, 
^ = + e 2 , 
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in which X a is the third semi-invariant (or third moment about the mean) of the 
parent. Then I rewrote the above, 


jht+hb,e‘^,b,t*+ 


r<*> /•« ( A„-W a ) / + e,\ 

3=8 I I dv% dr$0 (v 2 > vs) 6 ^ ^ .(14), 

J - CD J - CD 


as in ( 10 ), in which 0(p 2) i/ 3 ) i« the correlation function of v 2 and i / 3 due to sampling. 
The exponential term on the right was expanded, and I had 

V 30 


61 « 


■ do - 1 


*'« + jyi _ ! 
A a ^ V 






.(15), 


:o-*S + *S-“ 5 + -) 

!» - (*“'* 8 If + § - W jjj + ...)- SW, - W 

' , ‘-x?i‘ w "(~- fi £ +2i &- 5o S + ■) 

— 4&i6 3 — 36 2 2 - 6 M 1 2 — & 1 4 
in which the i/ r *s are moments of the correlation function 0(v 2i v 2 ). (j/oo = 1 .) 

For the v rg *H their values in terms of Sij(p 2 , i/ 3 )’s the semi-invariants of 0{r 2i V 3 ) 

were substituted in the above, and in the results gti written for • In my 

Xj 2 

thesis I have the final expressions which include all terms of order — 3 and higher 
in N. Here, for reasons to be discussed later in this paper, I will only reproduce all 
terms of order — 2 and higher in N. These are : 

bi = a 3 (1 — |< 7 io 4 - V^ao 4 - \f-/ 7 io 2 — f$< 7 ao — ^nrffwffvo + < 7 ao 2 ) 

+ (#01 — f #11 — $ #10 #01 + *£$21 + tygwgoi 4 - J r#ntfio ~ -Wifao ffn) 
l>2 = «3 2 (f#» - -V'^30 - ^rgxgio + W^720 a ) 

4- 2a 3 (— |^ u 4- ^-^21 + 4 # 2 o#oi 4- 6 « 7 n#io — 1 &- <7ao<7u) .. -(16) J 

4- (<7oa — 3^12 — 3^o2 t 7io — 3<7n<7oi 4- ^gn 2 4* 6 ^ 20 ^ 02 ) 
h = «a 3 (— ^< 7 ao 4- ^^^ 20 2 ) 4* 3a 3 2 (J £21 — 13#20<7n) 

4 - 3 a 3 (— f <712 4 ^ #20^02 4- a /-<7n 8 ) 4 - (//os — ^02^11) 

64 is of order — 3 in N, 

The series in ( 12 ) and (15) give rise to questions of convergence. In my thesis 
I imposed a sufficient condition on the sampling which has the effect in practice of 
ensuring convergence. This restriction was that the frequency distribution of sample 
1 / 2*8 be of limited variation. This got rid of the difficulties on the point of con¬ 
vergence in ( 12 ) and (15). Then, strictly speaking, the i/’s in (12) and (15) and 

2Q-3 
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the #’s which are substituted for them in obtaining (13) and (16) should be the 
moments and semi-invariants of the frequency distribution of sample i^s and of 
the correlation surface of sample v 2 s and respectively in which in both cases 
the range of sample v 2 b is restricted to an arbitrary interval about their mean. 
But the moments and the semi-invariants actually used are those on which no such 
limitation has been made. It was assumed for moments and semi-invariants of low 
order that the limited range is still large enough in practice for the error made to 
be negligible. But since the order of contact of the tail of a Type III distribution 
is not so high as it is in the case of the tails of a normal distribution, it appeared 
that in the case of a Type III parent this assumption had better be more carefully 
studied. My digression for this purpose grew to such proportions that it seemed 
better to give it a separate existence as a study of the semi-invariants and moments 
of incomplete frequency distributions both normal and of Type III. In that in¬ 
vestigation, which I hope shortly to publish, it is found that the assumption is valid 
for semi-invariants of v t of order as high as four for skewness of the parent not 
exceeding unity. Also this other paper makes it possible to answer another question 
which arises in the present connection, namely, concerning the systematic error 
which is introduced when in the sampling formulae arrived at in this present 
investigation, values of semi-invariants and moments determined from a finite and 
necessarily incomplete sample distribution are substituted in place of those of the 
parent. It must be well known that in the case of the Type III parent, at least, 
the differences so caused are by no means negligible; it appears from my work 
that these discrepancies alone render practically valueless the inclusion of semi¬ 
invariants or moments of any high order in such formulae as I give below. 

The calculation of the d s to order — 3 in N and of the b 's to order — 2 in N is 
carried out without the use of semi-invariants of order higher than four for either 
v 2 or v 2 and v 2 together and with the use of only the first eight semi-invariants of 
the parent distribution. Into the formulae (13) and (16) the values of the g *s as 
found from my thesis and expressed in terms of cr 3 by means of (2) were inserted 
and after reduction I found : 

To order — 3 in N, 


di -1 - ^(rW + l) + ji (ttt* 4 + iW-h) 

~ 3 + VrV a »* + tSs) 

rf * = 47y-d «» 2 + 2 ) " (U «3 4 + V «3 2 +1) 

+4^3 (w «3 e +*&■ a 3 4 +w - i) 

ds = (¥*3* +13*3* + 2)- ~ ( W « 3 « + a| 8 a a8 4 + Ai i a3 ._ f) 

2 

= ^3 ( V«3 4 + 4 7 «3*+28) 


V 


> ...(17). 
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To order — 2 in N, 

b, - «3 [i - ± + V) + Y* + ^ la * a+ W)] 

b t - i [^«, 4 + 9«3* + 6 )- i (W«3 6 + ^r-«3 4 + + 36)j ...(18). 

63 = + ^« 8 4 + ^«* s + 216) 

I was considerably perturbed over the apparent danger there was of 6 * becoming 
negative or quite small for small values of N, It is possible to calculate the 6 ’s to 
order —3 in N using the semi-invariants of v% and vz to not more than the fourth 
order and using only the first twelve semi-invariants of the parent distribution. 
I spent literally months performing the necessary preliminary computations and 
I did complete b\ and 62 to order — 3 in N before deciding it after all to be labour 
in vain. In the first place the results involve 03 to powers as high as the tenth and 
the standard error of <*3 is of the order of Vfi/AT« For small values of N it is quite 
useless to include high powers of « 3 in the values of the Vs. And for small values 
of N also the value of a 8 obtained from the sample is shown by my other investiga¬ 
tion to be nearly valueless for use in values of Vs which include high powers of #3 
as determined from the parent. 

Ann Arbor, 


Michigan, U.S.A. 



NOTE ON DR CRAIG’S PAPER. 

By EGON S. PEARSON, D.Sc. 

The formulae (17) ami (18) of the preceding paper contain approximate expres¬ 
sions for the semi-invariants and moment coefficients of the standard deviation and 
of \/fi l in samples from a Pearson Type III curve. The results are of considerable 
interest, but as it is important from the practical point of view to appreciate how 
far the expressions are convergent and how far they are modified by changes in 
sample size and population form, it seemed desirable to examine them numerically. 
With this suggestion Dr (Yaig has readily concurred, but he is of course in no way 
responsible for the conclusions which I have drawn. I have taken sizes of sample 
with A = 5, 10, 20, 50, 100, 250, 500 and 1000 (the last two only in considering 
the semi-invariants of \/Si)t and examined the position for samples from five 
populations with increasing skewness, namely /S a = 0, 0'2, 0-5, 1*0 and 15 corre¬ 
sponding to a 3 =r^jS=0, '447 2, ’7071, TO000 and T2247 in Dr Craig’s notation. 
Table 1 contains the numerical values of the successive terms of the expansion (17), 
and Table II of (18). It will be remembered that the population standard deviation 
is taken as unity. 

To summarise and compare these results I have formed Tables III, IV, V and 
VI. It is not of course possible to fix any exact point at which the expansions (17) 
and (18) become inadequate; this will depend partly on the purpose to which the 
results are turned. But I have assigned a rough scale of (? ?) and (?) to borderline 
cases. Any reader who is not satisfied with this classification win form his own 
directly from the Tables I and II. The expressions for d 4 (4th semi-invariant of 
the standard deviation) and for b 3 (3rd semi-invariant of V/?i) contain only one 
term, and therefore the marks noted against the tabled values of the B 2 of s and 
the B 1 of VA are very arbitrary. It will be clearest to discuss the latter tables in 
detail separately. 

The Mean of the Standard Deviation (dj = Mean s). Table III. 

Table I shows that the convergence is good except for very small samples from 
the skewer population. For the normal population the values obtained from (17) 
agree exactly to four decimal places with those obtained from “ Student’s ” curve 
and given in Biometrika, Vol. x. p. 529. The results show how with increasing N, 
the Mean s converges somewhat more slowly on the population a as the variation 
deviates from normality, but the difference is not great. 

The Standard Error of s (fd 3 = a t ). Table IV. 

I have compared Craig’s value from (17) with two other approximations, namely 
Approximation A, <r ( = ^ | 1 J & - 3 + 

_z / &- 1 

2 V N * 


Approximation B, a, 
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TABLE I. 

Distribution Constants of the Standard Deviation, s, in Samples of N. 


N 

d\ = Mean « 

rf a = <r, 9 

<h~fh( s ) 


(1) Nonnal Population with ft = a 3 2 — 0 

5 

1- -150 (XX)--008 760- 

*000662 

■100000--005 000-•001 500 

•010000+-001 500 

10 

1 - 76000- 2187- 

70 

50000- 1250- 187 

2600+ 187 

20 

1- 37500 - 547- 

9 

25 000- 312- 23 

625+ 23 

50 

1- 15000- 87- 

1 

10000 — 50- 1 

100 + 1 

100 

1- 7500 - 22 


5000- 12 

25 

250 

1- 3 (XX)- 3 


2000 - 2 

4 


(2) Population with ft«a 3 2 —0*2 

5 

1 — *157 5CX) - *002 207 — 

*005 825 

*115 000 - *020 212 + ’010 927 

•023 825-*015 778 

10 

1- 78 750 - 574- 

728 

57 500 - 5053+ 1366 

5 956- 1972 

20 

1- 39375- 144- 

91 

28 750- 1 263+ 171 

1489- 247 

50 

1 - 15750- 23 — 

6 

11500- 202+ 11 

238- 16 

100 

1 - 7875- 6- 

1 

5 750- 51+ 1 

60- 2 

250 

1- 3150- 1 


2300- 8 

9(5)- 0(1) 


(3) Population with j3 lS =« 3 2 = 0-5 

5 

1~ *168 750+*008 613- 

•020 260 

•137 500 - -045 703 + -043 427 

*047 656-*059458 

10 

1- 84 375+ 2153- 

2 532 

68 750- 11426+ 5 428 

11914- 7 432 

20 

1 - 42187+ 538- 

317 

34375- 2 856+ 679 

2 979- 929 

50 

1- 16875+ 86- 

20 

13 750- 457+ 43 

477 - 59 

100 

1- 8437+ 22- 

3 

6 875- 114+ 5 

119- 7 

250 

1- 3375+ 3 


2 750- 18 

19(1)- 0(5) 


(4) Population with ft = a 3 2 — 1*0 

5 

1- -187500+ '030078- 

*065 083 

•175 000-*096 312 +-141 445 

*095 625- -188 320 

10 

1- 93 750+ 7 520- 

8135 

87 500- 23828+ 17681 

23906- 23 540 

20 

1- 46875+ 1 880- 

1017 

43 750- 5957+ 2 210 

5977- 2943 

50 

1- 18 750+ 301- 

65 

17 500- 953+ 141 

956- 188 

100 

1- 9375+ 75- 

8 

8 750- 238+ 18 

239- 24 

250 

1 - 3 760+ 12- 

1 

3 500- 38+ l 

38(2)- 1(5) 


(5) Population with ft —a 3 2 * 1*5 

5 

1-206 250+*055 645-* 

141261 

*212 500 - *153 828 + *305 476 

•153 906-*401 382 

10 

1 - *103125+ 13911- 

17 658 

*106260- 38 457+ 38184 

38 477- 60173 

20 

1- 51562+ 3 478- 

2 207 

53125- 9 614+ 4773 

9619- 6272 

50 

l- 20625+ 556- 

141 

21250- 1 538+ 305 

1539- 401 

100 

1 - 10312+ 139- 

18 

10625- 385+ 38 

385- 50 

250 

1 - 4125+ 22- 

1 

4250- 62+ 2 

61(C)- 3(2) 
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TABLE III. 


Mean of Standard Deviation (di = Mean «). 
Population 


Size of 
Sample 

ft=0 

0‘2 

A=0-5 

© 

li 

ii 

Craig (17) 

Biometrika 
Vol. x. p. 529 

m 

Craig 

Craig 

Craig 

5 

•840688 

•8407 

•6344 

•8196 (?) 

•7776 (??) 


10 

•922 743 

•9227 

•9199 

•9152 (?) 

•9056 (?) 


20 

•961 944 

•9619 

•9604 

•9580 

•9540 

•9497 (?) 

D 

*984 912 


•9842 

•9832 

•981ft 

•9798 

■ 

*992 478 

•9925 

•9921 

•9916 



250 

•996997 

— 

•9968 

*9966 

•9963 

■9959 


It will be seen that for the normal population, the values from (17) almost 
agree throughout to 4 decimal places with those given in Biometrika , Vol. X. For 
very small samples the expansion to 3 terms in (17) becomes inadequate as the 
population becomes skew, but there is no reason to suppose that the Approxi¬ 
mation A is any more satisfactory in these cases. It will be noted however that 
as soon as the run of the terms in Table I suggests that the values from (17) are 
satisfactory, these values agree closely enough for most practical purposes with 
those of Approximation A. 

This is a result of some interest and shows the value of the latter expression 
as an approximation to the standard error of the standard deviation in non-normal 
material. For samples of more than 50 the very simple expression, Approxi¬ 
mation B, provides good values. 

In Biometrika , Vol. xn. pp. 276—277, K. Pearson has given general expressions 
for both Mean s and a, for samples from any population in terms of the first six 
moment coefficients of that population. On making use of the appropriate relations 
between the moment coefficients of Type III curves, it will be found that his 
equations ( R ) and (IF) correspond exactly with Craigs expressions for d x and da 
in (17) as far as the terms in 1/A 7 ® That is to say Craig has provided an additional 
term in these two expressions; the point at which this term can be neglected may 
be seen by examining Table I. 
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TABLE IV. 

Standard Error of Standard Deviation . (^2 = ov) 

Population 



Craig 

Approx, A 

Approx. B 

Craig 

Approx. A 

Approx. B 

5 

— 

•3578 

•4183 

— 

•3899 

•4010 

10 

- 

•2745 

*2958 

— 

•3009 

■3200 

20 

•2000(7?) 

•2017 

•2092 

•2197 (?) 

•2217 

■2305 


250 -0588 *0590 
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TABLE V. 

The Coefficients B x and Bt of the Sampling Distribution of the Standard Deviation. 




Pi-- 

= 0 

Craig (17) 

Iiiomet. 
Vol. x. 



ft = 0*5 

Pi = 1-0 

A *1-5 

Craig 

Craig 

Craig 


ft, = 3 + d 4 W 


Population 





•1C1 (??) 


076 (?) -103 (?) 

33 046 



50 

3 

100 

3 

250 

3 



3-089 

3-23 (?) 

3-46 (??) 

3-69 (??) 

3*044 

3*111 

3-22 (?) 

3-33 (?) 

3017 

3-044 

3-086 

3125 
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Tice Coefficients B x and B 2 for the Standard Deviation , Table V. 

(B^df/df, B^S + dt/df). 

For the normal population the values of B x agree closely with those given in 
Biometrika , Vol. x., but as the single term in d 4 contains the factor a d which is zero 
in this case, the values for B 2 obtained from (17) are all equal to 3. The con¬ 
vergence of the expression for d z is not satisfactory except for large samples, and 
therefore the values of B x can hardly be treated as accurate until we have reached 
a stage where for most purposes they could be taken as zero. There is no check 
on the degree of convergence of d 4i so that no great reliance must be placed on 
the differences between B 2 and 3. The value of the formulae seems therefore to 
lie in giving a rough appreciation of how soon the distribution of standard deviations 
may be taken as normal. 

The Mean >Jf3 x (h x ). Table VI. 

For samples from a normal or any symmetrical population this is zero. The 
quantity given in the first section of Table VI is the ratio of the Mean Sample V j3 x 
to the Population Vj8i or b x jc r 3 in Craig’s notation ; it tends to unity as N increases 
but not as quickly as the corresponding ratio for Mean s shown in Table III. For 
a given N the ratio changes only very slowly with increasing skewness. 

The Standard Error of y/fi x (er^ = y/b 2 ). Table VI. 

An expression for the standard error of fi x has long been used* ; it is however 
only the first order term in an expansion and vanishes if the population be 
symmetrical. For this reason, when dealing with normal populations, the standard 
error of VA has been employed; this to the first order is V 6/N\ Formula (18) 
provides in addition a second order term, namely for a normal population it gives 



The relative magnitude of these two terms for different values of N will be 
seen in Table II. The formula is also valuable in giving the standard error of \/ft x 
for skew Type III populations, provided the population be largo enough. 

The Coefficient B x for the distribution of *//3 x (B x = 6g a /feg 8 )« Table VI. 

As the expression for b z in (18) contains only one term, there is no means of 
judging at what point it becomes adequate, but clearly the convergence will be 
less satisfactory than for b 2 . The question-marks added to the figures in the third 
section of Table VI have therefore been somewhat arbitrarily assigned. It seems 
probable however that there is considerably greater skewness in the distribution 
of VA than in that of 8 . 

Dr Craig has referred at the end of his paper to the inaccuracy that would be 
involved by inserting into the formulae for the semi-invariants high powers of cr 8 

* Phil. Tram. A, Vol. 198, 1902, p. 278. Numerical values are given in Tablet for Statisticians and 
Biometriciam. 
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TABLE VI. 

Frequency Constants for Sampling Distribution of Vft. 


(Mean sample Jp } )/(Population v /ft) = b x /a* 


Standard Error of 


Sample / 3 i= 0 ' 2 ^= 0 ' 5 A-*® ft=l-5 po “ n ^=0-2 ft=0'5 ft = 1-0 ft=1-8 


50 *8732 (?) *8711 (?) *8696(??) '8707 (??) '32 (?) *33(??) 


100 -9329 


5 -9678 '9651 *153 


•9860 '9850 *9835 -9820 


■9929 

'9925 

i 

'9917 

•9909 


Size of Normal 
Sample Population 


ft=0-2 ft=05 ft = l*0 


'89(1?) 


*26 (?) ’94 (??) 


1 *06 (?) 

2*00 (?) 

'456 

*81 (?) 
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determined from the sample . In practice however the difficulty is perhaps not so 
serious as he believes, for an exact appreciation of the higher moments of the 
population is not really necessary. The value of this work of his and of similar 
research lies mainly in the light thrown on the manner in which the distribution 
of frequency constants in samples varies with changes in population form. Standard 
errors are associated with a population and not a sample, and it will be found that 
the answer to many statistical problems must be obtained not by assigning a 
standard error to the sample constants, but by considering the following question. 
Is it or is it not likely that the observed sample could have been drawn from a 
population of a certain specified form ? If, for example, we believe that a sample 
with a given V#i comes from some Type III population, we do not need to assign 
a standard error to this y^?i> but rather to find out two limiting population para¬ 
meters o 3 and a 3 ', the one above and the other below V&, such that 

~ V& = k<Tj Px and V/3i - = k'cr'^, 

the two standard errors being associated with a 3 and a 3 ' respectively*. That is to 
say we insert into the formula for b 2 of (18) not a sample value of or 3 =*\/£i> but 
certain hypothetical population values. 

* The values given to k and k f will depend on the probability limit which is chosen, and on the 
degree of skewness in the sampling distribution of . Certainly this latter is difficult to ascertain, 
and we may often have to be content with putting k~k f = 8 , say. 
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1. Introduction. 

The first attempt to obtain a prediction of stature from measurements of the 
long bones on the basis of the correlational calculus was made by Pearson in 1898f. 
The particular problem that Pearson set himself to solve at that time was that of 
reconstructing the stature of prehistoric races. The only data then available for 
calculating the necessary coefficients of correlation between human stature and 
long bones, and between the various long bones themselves, were those provided 
by Rollet in his work De la Mensuration des Os longs des Membres ) Lyons, 1889, 
and it was from the means, standard deviations, and correlation coefficients of this 
French material that Pearson’s original stature regression formulae were derived. 
The question of applying formulae of one local race to another was carefully con¬ 
sidered. Pearson in the course of his discussion of this question in the original 
paper states that “ the extension of the stature regression formulae from one local 
race—say, modern French—to other races—say, palaeolithic man—must be made 
with very great caution,” nevertheless such extension was deemed theoretically 
permissible on the assumptions (1) that stature represents an indirectly selected 
racial character, and (2) that whereas regression formulae in general might be 
expected to change from one race to another yet certain of these, viz. regression 
formulae of indirectly on directly selected characters, should not change; while in 
the case of others, where mere size is the chief factor involved, the differences 
encountered should be only of the second or third order of small quantities 
These conclusions seemed to find practical justification in the trial application of 

* Formerly Peking Union Medical College, Peking, China. 

t “On the Reconstruction of the Stature of Prehistoric Races,” Phil. Trans. A, Yol. 192, 
pp. 169-244. 

t Loc. cit . p. 177. 








304 


Paul Huston Stevenson 


the regression formulae based on the French data to the reconstruction of stature 
from the long bones in the case of such a widely separated racial group as the 
Aino. 

The accumulation of comparable osteometric data in the case of a second and 
distinct contemporaneous racial group, the Chinese, affords an opportunity of 
testing directly the validity of applying regression formulae derived from one 
branch of the human race to another. Without anticipating in too great a degree 
the conclusions arrived at below, it may be stated that the results of such a test do 
not sustain the early confidence in the general applicability of the particular regres¬ 
sion formulae in question to all racial groups. While such a conclusion may call 
for a general reconsideration of the premises upon which the original assumptions 
were made, a more immediate implication lies in the desirability of working-out 
specific regression formulae for different racial groups as soon as the necessary 
data become available *. Such racially specific formulae should not only better 
serve the practical end of more reliable predictions in the case of the groups 
for which they have been derived but should also, in their differences, provide 
a means of studying the direction and degree of organic differentiation among the 
racial groups thus studied. 

The present paper presents stature regression formulae specifically derived 
from Chinese data and compares these with Pearson’s original stature regression 
formulae based on Rollet’s French data. Use is made also of certain available 
constants for Aino and Naqada skeletons for a slightly wider comparison of racial 
variations in general. The writer gratefully acknowledges his personal indebtedness 
to Professor Pearson for his stimulating interest in the problem under consideration, 
and for his generous permission to make use of certain preliminary notes made by 
him on the same subject. 

2. Data and Treatment . 

The data necessary for the calculation of stature regression formulae in the 
case of the Chinese are to be found in the osteometric records based on the 
collection of Chinese osteological material in the Department of Anatomy of the 
Peiping Union Medical College of Peiping. The actual measurements were made 
by Drs Gerhard von Bonin and M. T. Pan. The writer assumes full responsibility 
for the calculation of the various constants, correlation coefficients and’ regression 
formulae based upon these measurements. Lack of adequate female skeletal 
material confines the discussion to males alone, and the material is further re¬ 
stricted to representatives of the North China population. 

* In this connection Professor Pearson Btates that be prepared many years ago a schedule for taking 
cadaver and long bone lengths in the post-mortem room, and sent it to a number of anatomists. 
It produced nothing at aU in England. In Strasburg Gustav Schwalbe promised aid, and shortly before 
the Great War measurements had been made on 80 male and some 40 female subjects. Schwalbe died 
during the war, and Professor Pearson has been unable to find anyone who knows what has become of 
the material. The data which were in England in 1918 or 1914 were returned to Sohwalbe as he thought 
he would be able ultimately to oomplete 100 cadavers of each sex. The rediscovery and reduction of 
these valuable data would be of great importance. 
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All measurements on the Chinese material were taken on well-macerated bones, 
with cartilage removed and after the lapse of ample time to ensure thorough drying. 
Maximum length measurements only are used, those of the tibia were made without 
the spine but including the malleolus. To render the Chinese formulae comparable in 
all respects to Pearson’s formulae for the French the measurements of the right 
bones only are used. Rigid rejection and selection in the interests of assured 
normality still further restricted the Chinese material finally used to forty-eight 
skeletons. The constants for the French, Aino and Naqada tabled together with 
those for the Chinese are taken in the first two instances from the original paper 
by Pearson ( loo, cib.) and in the last from the report of Warren on the Naqada race 0 . 

The theory of regression undoubtedly affords the most reliable method of 
determining the best prediction of stature from giverf measurement of the long 
bones. Thus if X 0i &n be the mean values of the stature and the length of one of 
the long bones respectively as calculated from a sample of the population in 
question, likewise <r 0 , cr n their standard deviations and the coefficient of 
correlation between them, then the most probable stature X 0 to be predicted from 
a particular bone length x n is given by the simple regression formula 


! ron ~ (®» ■ 


®n). 


•(i)> 


*0 


X n 


Xq — Xq 

which may bo transformed into 

X 0 - (%o-ron~ ffn) + 

= Ci *f 

where and c% are constants specifically derived from the sampled values of the 
two variates under consideration. The probable error of this determination is 
*67449<r 0 Vl — ro» a if the stature prediction is made for a single individual, or this 
amount divided by Vn if the x n used in the determination represents the mean of 
measurements on n individuals, as would be the case if the prediction of the most 
probable stature of a group or race is desired and the mean value of measurements 
on n individuals is available for the prediction. 

For the prediction of stature on the basis of given measurements of more than 
one bone, recourse must be had to the multiple regression formula. The most 
convenient working form of this formula is that in which the multiple regression 
coefficients are expressed in the form of determinants, viz. 


#1) + (#a — &*) + . • • 4 —— (®» * 

<r% cr n 


•*»)}■ 


•(«). 


*•-*>—£&<*■ 
where x 1 ...x n represent the respective long bones, and Aoo*.*Aon the minors of 
the determinant 
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* Phil. Tram. B, Vol. 89, pp. 185—928. 
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in which ?’ 01 ... r 0;l represent the various correlations between stature and the 
different long bones, and r 12 ...r n _i n the correlations between the bones them¬ 
selves. 

Substitution of the values of the variable factors concerned in formula (ii) 
transforms the equation into the form 

X 0 * Ci *4“ c 2 # 1 4“ <r 2 .. • Cn+i x n , 

where the stature is again expressed in terms of a linear function of the various 

long bone measurements. Formula (ii) has a probable error of *6744900 

with a similar provision for reduction by Vn in the case of the determination being 
based upon measurements of more than one individual. 

3. Comparisons of Data involved in Stature Regression Formulae . 

It is quite obvious from a consideration of their derivation as shown above, 
that the constants C\ and c 2 in the final regression formulae are functions of the 
respective racial means, standard deviations and coefficients of correlation. Wo 
will commence therefore with a comparison of these variates in the case of the 
French and Chinese (Table I). Flanking these values for the French and Chinese 
are those also of the Naqada and Aino, providing a wider basis of comparison of 
the general interracial differences in the characters concerned. 

The first thing to attract attention in the above table is the significant 
difference in cadaver length in the case of the French and Chinese. Lest the 
figure for the latter be suspected in error it may be mentioned that all the 
cadavers in the series in question are of Northern Chinese, whose mean living 
stature has been adequately determined as 168*830 cms.* In regard to the bone 
lengths of the French and Chinese only the humerus shows a difference that 
proves statistically significant, yet the consistently smaller value for the Chinese 
limb bones is particularly worthy of attention in the light of the reverse total 
body length difference noted above. The Naqada and Aino on different ends of 
the size scale show very marked differences in limb bone lengths. When the four 
races are grouped together as representing Mediterranean and Oriental racial 
stocks respectively a racial differentiation on the basis of shorter limb segment 
lengths in the case of the latter seems to be clearly indicated. 

As regards variability the table shows that the Chinese are both absolutely 
(cf. standard deviations) and relatively (cf. coefficients of variation) significantly 
less variable than the French. While the French and Naqada variabilities are. not 
individually significantly different, the ancient Naqada considered as a whole show 

* There are well-demonstrated total size differences between the Chinese of different regions of 
China—those of Central and Sonth China having mean statures of 165*1 and 168*0 ems. respectively as 
compared with the considerably higher stature of 168*8 cms. of the North. (This latter figure is based on 
unpublished data on eleven hundred individuals.) Cf. P. H. Stevenson, “ Collected Anthropometric data 
on the Chinese,'* China Medical Journal , 1925, Vol. xxxix. pp. 855—898. 
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greater variability than the French. The Aino tends to agree in the main with 
the French and Naqada in being somewhat more variable than the Chinese. We 
may summarise the figures shown in this table by saying that the limb bones of 
the Naqada and French races are longer than those of the Chinese and Aino. 
and the Aino agrees with the two former races in exhibiting a greater variability 
than the Chinese. The possible influence of these differences on the regression 
formulae will be discussed later. 


TABLE I. 

Stature ( Cadaver) and Dry Long Bone Measurements. 

(Absolute measurements in cms.) 


Means: 

Stature 

Femur 

Tibia 

Humerus 

Radius 

Naqada 

French (0 

Chinese 

Aino( 6 ) 

45-930 + 0-170 
37-970 ±0-137 
32-618±0-146 
25-697 + 0-127 

166*260±0*525 
( rf ) 44*578 ±0*226 
36*336 + 0*172 
32*600 + 0*147 
24*174±0*112 

168 "923 ±0-528 
43-975+0-174 
36-248 ±0149 

31 073±0-115 
23-779 ±0-096 

157*900< e ) 
40*770±0*193 
33*895+ 0 183 
29*502 + 0*135 
22*913±0*121 

Standard De 

Stature 

Femur 

Tibia 

llumerus 

Radius 

viations: 

2*519 + 0-134 
1*877+0*097 
1*701 ±0103 
1*290 + 0*090 

5*502 + 0*371 
2*372 + 0*160 
1*799±0*121 

1 *538 ±0*104 
1*170±0*079 

5 *424 ±0*373 
1*788 ±0*123 
1*535 ±0*106 
1*182 ±0*081 
0*987 ±0*068 

1*898 ±0*136 
1*668 ±0*129 
1*343 + 0*095 
1*117±0*086 

Coefficients oj 

Stature 

Femur 

Tibia 

Humerus 

Radius 

f Variation: 

5*484 ±0*293 
4*943 ±0*256 
5*216±0*317 
5*021+0*350 

3 *309 ±0*223 
5*425 + 0*354 
4*888 ±0*330 
4*659 ±0*314 
4*796 + 0*323 

3*211 ±0*221 

4 *066 ±0*280 

4 *234 ±0*291 

3 *802 ±0-262 
4*160 ±0*286 

4*655 + 0*336 
4*921 ±0*381 
4*552 ±0*322 

4 *875 ±0*375 


(а) Taken from Warron, Phil, Trans. B, Vol. 189, 1897, pp. 185—-191. 

(б) Taken from Pearson and Lee, R. 8. Proc . Vol. 61, 1897, p. 847 and Phil. Trans . A, Vol. 192, 
pp. 169 et seq . 

(c) This measurement of the Aino represents the cadaver length as estimated on the basis of a living 
stature of 156*7 cms. plus 1*2 cms., which is a proportional estimate of the amonnt figured by Pearson 
(loc. cit. p. 191) as the difference between living stature and corpse length. [The living stature given here 
is for the Yezo and Saohalin Aino, not for the Shikotan Aino.— Ed.] 

(d) The mean lengths given here for the French long bones are not based on the data given originally 
by Rollet but represent the mean lengths found by Pearson after the subtractions calculated by him 
(loc. cit . p. 195) as necessary to reduoe the lengths of freshly autopsied bones with cartilages to the 
corresponding lengths of dry bones without cartilage and free of animal matter. All the measurements 
of long bones in this table are therefore strictly comparable. 


21—2 
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TABLE II. 

Stature ( Cadaver ) and Long Bone Correlations (Males) {a) . 


Race 

Stature 

Femur 

Tibia 

Humerus 

Radius 


( Naqada 

_ 

-. 

_ 



g 

I French 

1 

•8105 ±*0327 

•7769+*0378 

•8091 ± *0329 

•69561*0492 

-4-3 

$ 

j Chinese 

l 

•8036 ± ’0344 

*8563 ±‘0260 

*6128 ±‘0608 

•6801 ± -0523 

m 

[Aino 

— 

— 

— 


— 


( Naqada 

_ 

1 

•9164 ±*0115 

*8416 ±*0248 

•84651-0295 

3 

Q 

I French 

•81051*0327 

1 

•8068 ± 0336 

•8421 ± -0277 

•7439+ *0426 

1 Chinese 

•8036 ±*0344 

1 

•8904± *0202 

•7377 ±-0443 

•6696+ -0536 


\Aino 

***“ 

1 

•8266 ± *0338 

•8584 ± *0274 

•7891 + -0418 

# c8 

f Naqada 


*9164± *0115 

1 

•8497+-0218 

•8605+-0247 

1 French 

•7769 ±'0378 

•8068 ±-0335 

1 

•8601 ± -0248 

•78041-0373 

EH 

1 Chinese 

•8663 ±-0260 

•8904+-0202 

1 

•6866+-0514 

•7642 1 0406 


\Aino 

— 

•8266 ±-0338 

1 

■7447 ±-0481 

•86551 -0286 

. 

fNaqada 

_ 

•8416 ± -0248 

•8497 ±-0218 

1 

*82321*0308 

a> 

J French 

•8091 + -0329 

•8421 ± -0277 

•8601 + -0248 

1 

*8451 + *0273 

B 

1 Chinese 

•6128 ±’0608 

•7377 ±'0443 

•6866 ±-0614 

1 

•67151*0534 

w 

[Aino 

— 

•8584 ±-0274 

•7447 ±-0481 

1 

•77631-0429 

1 

( Naqada 

. 

•8465 ± *0295 

•8505 +-0247 

•8232±-0308 

1 

J French 

•6956 ± 0492 

•7439 ±*0426 

•7804 +*0373 

•8461 ± -0273 

1 

£ 

1 Chinese 

•6801 ± -0523 

*6696 ± -0536 

•7642+-0405 

•6715± -0534 

1 

l^Aino 


•7891 ± *0418 

•8655 ± *0286 

•7763+-0429 

1 


(a) French, Aino and Naqada values taken from Pearson, Lee and Warren as cited under Table I. 


We turn next to the correlations; these are given in Table II. It is to be 
noted at the outset that owing to the smallness of the members in the various 
series* the probable errors of the coefficients run high notwithstanding thp high 
values of the correlations. Care must be exercised therefore in attributing significance 
to the differences observed. Of the correlations between stature and various long 
bones in the French and Chinese, only in the case of that between the stature and 
humerus do we find a difference approaching statistical significance, the difference 
here being 2*9 times its probable error. A peculiar fact to be noted in connection 
with the stature long bone correlations in these two races is that whereas in the 
Chinese it is the distal limb segments that show the highest correlation with 
stature, the reverse is true in the case of the French. 

* The number of individuals figuring in the above correlations are Chinese 48, French 60, Aino 
varying from 82 to 89, Naqada varying from 24 to 88. 
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Turning to the correlations between the various long bones themselves we note 
that with the exception of that between femur and tibia all the Chinese correlations 
are smaller than the French. In addition to its low correlation with stature noted 
above the Chinese humerus seems to be significantly low in its correlations with 
other long bones also. Glancing at the correlations on the part of the Naqada and 
Aino groups we note that the Aino, except in the single case of the correlation 
between the humerus and tibia, is in closer agreement with the French and 
Naqada than with the Chinese. These correlation differences are not great how¬ 
ever, and until confirmed (or otherwise) by data on other allied races should not 
have too great importance attached to them. 

The remaining constants necessary to complete the formulae are the combined 
lengths of femur plus tibia and humerus plus radius. These are: 


Femur plus Tibia 

Mean 

S.D. 

Correlation 

Chinese . 

80*223 

3*231 

*8515 

French . 

80-848 

3*979 

*8384 

Humerus plus Radius 

Chinese . 

54*852 

1*984 

•7032 

French . 

56*738 

2-536 

*7973 


Having noted the racial differences in size, variability and correlations in the 
Chinese and French statures and long bones we are now ready to study the 
resultant differences in the respective regression formulae derived from them. 
Substituting in the generalised regression formulae (i), (ii) the appropriate racial 
means, standard deviations and correlation coefficients, we obtain the two series of 
regression formulae tabled together in Table IIL The first constant in the French 
formulae has been recalculated to provide in each case the cadaver length (rather 
than the living stature) prediction*. The two sets of formulae are therefore strictly 
comparable in all respects. 

There is no need to pause over the individual differences in the corresponding 
constants in these regression formulae, such differences being expected in the 
nature of the case. Glancing at the probable errors it is to be noted that while in 

* The French formulae as given here differ sUghtly therefore from either of the two aeries derived by 
Pearson. His first series (ioc. tit. Table VII) were “ Formulae for the Reconstruction of the Stature as 
Corpse, the Maximum Lengths of the F., H., B. and T. without Spine being measured with the Cartilage 
on and in a Hamid State”; his second (Table XIV), “Living Stature from Dead Long Bones.” The 
French formulae as recalculated for this paper are for Cadaver Length from Dead (Dry) Long Bones, 
the same as in the case of the Chinese formulae. [The “ recalculation ” should consist in adding 1*26 cms. 
to Pearson’s formulae for stature prediction (ioc, tit . pp. 187 and 191). I do not understand how 
Dr Stevenson reaches a fourth decimal place in the,constant terms, Pearson having only three, or why 
his values of the oonstant terms are far from agreeing with Pearson’s constants plus 1*26 cms* Ed.] 
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the case of the French the best single bone prediction is to be had from the 
femur (a), in the case of the Chinese the best single bone prediction is to be had 
from the tibia (c). Further, whereas in the French the combination of the tibia 
with the femur, either by adding the two lengths together (e) or by using a 
bivariate formula (/), materially improves the prediction, in the Chinese no 
appreciable gain in accuracy is to be obtained through combining any other bone 
with the tibia. In the French the excellent prediction on the basis of the bivariate 
formula (i) is hardly improved upon by using all the bones ( k ). In the Chinese 
the best prediction of all is based on all the bones (&). 

TABLE III. 


Prediction Equations for Cadaver Lengths from Dry Long Bones . 


From Chinese Data 

From French Data 

(a) <7= 61 -7207 + 2"4378 F ±2-1766 

(b) =81-6115 + 2-8131 ff +2-8903 

(e) = 59-2266 + 3-0263 T +1 -8916 

(d) =80-0276 + 3-7384 ft +2-6791 

(e) — 54-2522 +1 "4294 (F+T) +1-9214 

( f) =65-3866+0-6024 /’+2-4014 T+ 1 -8605 
(q) = 63-4865 +1 9222 (ff+ It) ±2-0529 

(A) = 64-4272 +1 -3052 77+2 -6889 ft±2-5691 
(?) =59-7828 + 2-3397 F+ 0-2012 //±2‘1747 
(j) = 57-1954 + 2-8594 T +0-3398 ft +1 -8838 

(t) =52-2596+0-6640 /’+2-2065 T 

- 0-1008 U +0 -4464 It +1 -8201 

(a) C=82-5661 + 1 •880/’ ±2‘174 

(A) =71-9156 + 2"894 H ±2-181 

(c) =79-9257+ 2-376 T ±2-337 

(d) =87-1868 + 3-271 ft ±2 666 

(e) = 72-5572 + 1 ’159 (F+T) ±2-023 

( f) =72 -7062 +1 -220 F +1 -080 T ± 2 -030 

( q) =68-1033 +1-730 (ft+ft) ±2-240 

(/i) =71-2767 + 2-769//+0-195 ft ±2-179 

(i) =69-6483 + 1 •030/’+! -557 //+1-962 

(j) *=73-7659 + 1-831 T + 1-074 ft + 2-276 

<}) =68-3990 +0-913/’ + 0-600r 

+1 -226 H- 0-187 ft ± 1 -961 


4. Application of Formulas of one Race to another . 

We pass now to the question of applying formulae specifically derived from 
data of one race to prediction in the case of a second. The first test of such 
application is found in Table IV. The probable errors of the predictions herein 
tabulated are for 48 and 50 individuals respectively. It is to be seen at a glance 
that the average Chinese stature prediction from French formulae is over 4 cms. 
too small, and that of French stature from Chinese formulae about the same 
amount too large. These differences are of an order of more than seven times 
their own probable errors, and are quite obviously too great to justify the application 
of French formulae to Chinese or vice versa. In fact such a difference indicates 
a statistical improbability of the order of several millions to one that the formulae 
of one of these two races will provide a satisfactory prediction of the stature of an 
individual belonging to the other. 

It will be of interest to review briefly at this point the various racial factors 
entering into the derivation of these formulae, with the purpose in mind of 
determining which are responsible for the failure of the formulae of one of the 
two races in question to provide suitable predictions in the case of the other. 

[ # This formula seems to me erroneous; it has been worked out for eadaver length and humid bones 
with cartilages attached. I think the constant term should be 74*864. Ed.] 
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In this connection we must take note of differences in absolute.size, absolute and 
relative variabilities, correlations between the factors involved, and the racial 
differences in stature-limb proportions. We will briefly discuss these points, as 
far as our data permits, in the reverse order of that just named. 


TABLE IV. 

Trial Reconstruction of Stature (Cadaver Length). 


Formula 

Chinese from French 

French from Chinese 

True Value 

168*923± *628 

166 *260 ±*525 

(«) 

165*239 ±'314 

170*247 ±‘307 

(6) 

161 *841 ±*318 

173*218 ±*408 

(«) 

166*051 ± *307 

169 *190 ±*267 

(d) 

164*968 ±*384 

170*400+*378 

165*536 ±*291 

169*825 ±*271 

( 7 ) 

165*502+-293 

169*461 ±*263 

(9) 

(A) 

162 *997 ±*323 

172 *617 ±*375 

161*955+ *314 

171*978 ±*363 

(i) 

(/) 

163*323 ±*283 

170*501+*307 

165*675 ± *328* 

169*309 ±*266 

(*) 

163 *915 ±*283 

169*500±*257 

Moan (a) — (k) 

164 *273 ±*312 

170*5681*315 


(a) Influence of Differences in Stature-Limb Proportions. With shorter limb 
bones the Northern Chinese are nevertheless taller than the French. The question 
immediately arises as to the possibility of judging in advance as to the applicability 
(or otherwise) of formulae of one race to a second on the basis of similarity or 
dissimilarity of the sitting height-stature index f. At first sight this suggestion 
seems reasonable on the grounds that the sitting height-stature index, by definitely 
indicating the total lower limb-stature proportion, might be expected to provide 
an indication of stature-limb proportions in general. The natural assumption would 
be that the greater the difference between two races in the sitting height-stature 
index the less likely would the formulae of either race prove applicable to 
the other. 

To test this assumption it is necessary to have recourse to stature reconstruction 
data on a third race, as well as the sitting height-stature indices of all three races 
under consideration. The racial variables given for the Aino prove useful for this 

[* This value should, I think, be 166*778, raising the mean to 164*888. Ed.] 
t Professor Pearson in commenting on the proportionately shorter limb bones of the Chinese says : 
“Their vertebral columns must be relatively longer, and accordingly their index: 100 Bitting height/ 
standing height, should differ very sensibly from that of the French. Before applying our French 
reconstruction formulae to a second race, it would certainly be wise, where it is possible, to test whether 
the above index is approximately the same for the two raoes.” 













312 


Paul Huston Stevenson 


purpose. Tablo Y presents the parallel series of Aino stature reconstructions on 
the bases of the French and Chinese formulae respectively, and Table VI contains 
the figures necessary for an estimate of the influence of sitting height-stature 
index on the applicability of the Chinese and French regression formulae to each 
of the three races. The sitting height-stature indices for the French and Aino are 
taken from Martin # , that for the Chinese from the writer’s unpublished data 
on over one thousand Chinese. 


TABLE V. 

Aino Stature Predictions. 


Formulae 

French 

Chinese 

(a) 

159-214 

16M10 

(6) 

157-294 

164*503 

(c) 

160-460 

161*802 

(d) 

162*135 

165-685 

(€) 

159*094 

160*978 

(/) 

159*051 

161*342 

iff) 

158*781 

164*239 

(A) 

157*436 

164*544 

(0 , 

157*576 

161*108 

6) 

160*436 

161*901 

<*> 

157*814 

161*375 

Mean (a) —( k) 

159*026 

162*599 


TABLE VI. 

Influence of Sitting Height-Stature Index . 


Race 

Sitting Height- 
Stature Index 

Actual 
Stature f 

Predicted Statures from 

Ch. Formula 

Diff. 

Fr. Formula 

Diff. 

French 

51*9 

166*260 

170-668 

E1 



Chinese 

53*9 

168*923 

— 

M 

164*273 

4-650 

Aino 

64-8J 

157*900 

162*599 

4*699 

159*026 

1-126 


The results shown in Table VI are interesting and not a little surprising. 
The sitting height-stature index of the Aino, as is well known, is very high. This 
race in fact represents one extreme in trunk-limb proportion. The index for the 
Chinese is much nearer the Aino than either of these is to the French. On the 

* Rudolf Martin, Lehrbuch der Anthropologic, 2nd edition, Vol. x. p. 889. Jena, Gustav Fischer, 1928. 

t Cadaver length. 

[t I do not Bee why the Shikotan Aino should be eited by Martin, nor used by Dr Stevenson; Koganei, 
who took the measurements, considers this value exceptional, and gives for the Yezo Aino 52*4 and for 
hiB general Aino mean 52*7, which place the Aino nearer to the French than to the Chinese* Ed.] 
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assumption stated above the Chinese formulae should give a much better prediction 
of Aino stature than the French formulae. The fact is, however, that the pre¬ 
diction of Aino stature by use of the French formulae, in spite of the wide 
divergence in sitting height index, is very much better than that given by 
the Chinese. 

It may be objected with reason that the index in question is concerned only 
with relative proportions of the lower limbs to the stature. Using the formulae 
based on lower limb segments alone might therefore be expected to give a better 
Aino stature prediction from the Chinese than from the French formulae. But 
Chinese formulae (a), (c), ( e) and (/), in which the lower limb segments alone 
are involved, give an average prediction of Aino stature of 161*308 as against 
169*454 cms. given by the same formulae of the French; the actual stature being 
157*90 cms. Clearly we must conclude that the sitting height-stature formula 
fails‘singularly as a criterion of the applicability of racial formula from one race 
to another. 


TABLE VII. 

Comparison of Separate Limb Segment-Stature Ratios . 



Femur 

Humerus 

Tibia 

Badius 

French 

•2678 

•1961 

•2185 

•1464 

Chinese 

•2603 

•1839 

•2146 

•1468 

Aino 

•2682 

•1868 

•2147 

•1461 


But there are other much more critical limb proportion tests than the one just 
used. Table VII for instance gives a comparison of the separate limb segment to 
stature ratios in the three races under consideration. Here it is to be noted that 
with the single exception of the radius the Aino and Chinese are again nearer to 
each other than either is to the French. Excluding the formulae containing the 
radius should therefore, if limb proportions play an important rfile in determining 
the applicability of formulae of one race to another, provide a better prediction 
of Aino stature from the Chinese than from the French formulae. The average 
Aino statures predicted from formulae without the radius, however, are from the 
French formulae 158*781 cms., and from the Chinese 161*807 cms. The French 
formulae still give the better prediction. 

The results just noted suggest a still further step in limb proportion analysis. 
Table VIII gives a comparison of the stature and individual limb segment ratios 
of the Aino and Chinese, each with respect to the French. The total size differences 
between the two races obscure the comparison in the first two columns, but when 
the Aino ratios are adjusted to what they would be were the statures of the two 
races the same—i.e. multiplying each by the factor necessary to equalise the 
statures, as is done in the last column of the table—then the relative ratios of 
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the Aino find Chinese limb segments to the corresponding French limb segments 
are indicated clearly. The results are the same as those noted above, namely, 
except in the case of the radius, the Aino and Chinese limb segments bear 
practically the same proportions to the French. The Chinese should, on this basis, 
have as good prediction values from the French formulae as the Aino. 


TABLE VIII. 

Stature and Limb Segment Ratios of Aino and Chinese respectively to French . 



Aino 

Chinese 

Aino (a)* 

Stature 

•9497 

1-0160 

1*0160 

Femur 

•9158 

•9878 

•9797 

Humerus ... 

•9050 

*9532 

•9681 

Tibia 

•9329 

•9976 

•9980 

Radius 

*9478 

•9837 

1-1040 


The evidence just presented leads us to the conclusion therefore that' racial 
similarities or differences in limb segment and stature proportions, or limb segment 
proportions per se , are not the chief factors concerned in the question of the 
satisfactory application of stature reconstruction formulae from one race to another. 

(b) Influence of Size , Variability and Correlation. It is not difficult to show 
that the values of the different constants in the various regression formulae depend 
upon the means, standard deviations and correlation coefficients of the racial 
characters involved. The following form of the regression equation— 

\ &B / c B 

= <h + c 2 5, 

where S and B represent the stature and one of the long bones respectively, and 
M 8 and M Bi <r s and a Bi and r SB their means, standard deviations and correlation 
coefficients—expresses clearly this intimate relation between the ultimate values 
of the constants c x and c a and the various racial variables. Thus the first constant 
(c x ) is seen to be a function of all five variables, while the second (cs) is the 

regression coefficient itself Although it is possible to formulate a 

mathematical proof of the influence of single or associated deviations in the case 
of each of these racial variables on the derived constants, yet the empirical results 
given in Table IX will suffice to demonstrate the general character of the resultant 
changes brought about in the constants by the indicated changes in the respective 
variables. 


Adjusted to Chinese stature equivalents. 














TABLE IX. 

Influence of Variations in Racial Size* Variability and Correlations 

on Regression Constants. 

A. Variations in Sizb. 


(cl) Of stature. 


Stature 

Long Bone 


Equation Constants 


Mean 

S.D. 

Mean 

S.D. 

Correlation 


«a 

Cl 

160-00 

5-00 . 

45-00 

2-00 

■80 

70-00 

2-00 

CL 

165-00 


99 



75-00 


a 

170-00 


99 

99 

99 

80-00 

** 

(b) Of long bone. 

b 

165*00 

99 

43-64 



77-72 


b 

77 

99 

46 00 

99 

99 

75-00 

99 

b 

99 

99 

46-36 

99 

99 

72*28 

99 

( c ) Of both stature and long bone, in 

same direction. 



c 

160-00 

99 

43-64 



72-72 


c 

165-00 


45-00 


99 

75-00 


c 

170*00 

99 

46-36 


99 

77*28 

99 

(d) Of both stature and long bone. In 

opposite directions. 



d 

160-00 


46-36 



67*28 


d 

165 00 


46 *(X) 


99 

75 00 

99 

d 

170 00 


43*64 

99 

99 

82-72 

99 


15. Variations in Variability. 


(cl) Of stature. 

a 

165*00 

4*00 

45-00 

2-00 

•80 

93*00 

1-60 

tt 


5*00 




75*00 

2-00 

CL 

>7 

6-00 

77 

77 

»7 

67 00 

2-40 

(6) Of long bone. 

b 


5 00 


1-60 


52*50 

2-50 

b 




2-00 


75-00 

2*00 

b 

77 

77 

77 

2-40 

77 

9000 

1-67 

(c) Of both stature and long bone, in 

same direction. 



c 


4-00 


1-60 


75*00 

2*00 

c 


5-00 


2-00 


75 00 

2-00 

c 

77 

6 00 

7* 

2*40 


75*00 

2-00 

(d) Of both stature and long bone, in 

opposite directions. 



d 

9* 

4-00 


2-40 


105-00 

1 -33 

d 

97 

5-00 


2-00 


75 00 

2-00 

d 

99 

6*00 

*7 

1*60 

77 

30-00 

3-00 


C. Variations in Cobrslations. 


a 

166-00 

6*00 

45-00 

2-00 

•70 

86-25 

1*76 

CL 

77 

77 

77 

77 

•80 

75*00 

2*00 

CL 

>7 

77 

77 

77 

•90 

63-76 

2*25 
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If we start with a set of hypothetical racial values—e.g. for stature, let the mean 
be 165*00 cms., s.D. 5*00; for the long bone (femur), mean 45*00, S.D. 2*00; corre¬ 
lation r SB — *80; the regression equation being S = 75*00 + 2*00 F —then Table IX 
shows in the last two columns the changes that occur in the values of the two 
constants of the regression equation incident upon the various changes in the 
racial variables indicated in the sub-headings of the" various sections of the table. 
The centre row of each section repeats the hypothetical values just given; the line 
above this centre row shows the effect of a decrease in the variable concerned, the 
row below the effect of an increase. Variations in size and variability in the case 
of the femur are proportional to the variations arbitrarily chosen for the stature. 
The various ranges of variations chosen for illustration are all of them perfectly 
normal and likely to occur in the light of the racial values given in Tables I 
and II and experience with other racial data. 

A glance at the results given in the last two columns of Table IX (p. 315) shows 
several things. In the first place, as is easily understood from a knowledge of 
the derivation of the constants, variations in size (cf. Section A of the table) 
affect only the first constant. Any variation in the first variable alone, for instance, 
results in a change in Ci corresponding both in amount and in direction, while 
a variation in the second variable alone produces a change in the opposite direction 
in the constant and of an amount determined by the regression coefficient 

• It is in the case of differences in the sizes of both variables and in 

opposite directions, cf. A ( d ), that the greatest change occurs in the constant. It is 
in this latter category, it so happens, that the stature and long bone differences 
fall in the case of our French and Chinese material. 

Differences in variability (cf. Section B of the table) affect both constants 
except when the differences are exactly proportional and in the same direction. 
Furthermore, the resultant changes in the first constant are considerably larger 
than those resulting from differences in size only. The balancing effect of pro¬ 
portional differences in both variables and in the same direction is also to be noted, 
cf. B (c), although this fact is easily explainable by the particular relation these 

variables bear to each other in the derivation of the constants. With respect 

to those of the French the variabilities of our Chinese material may be considered 
as falling into category B (5) of the table. 

The effect of variations in correlation are shown in Section C of the table. 
Both constants are affected, and always in the same direction. 

Although there is nothing in the above table that a knowledge of the theory 
of regression would not have anticipated, yet the results tabulated therein make 
possible a very simple visualisation of the relative influence of common racial 
variations in size, variability and correlation on the two regression formula 
constants. In actual practice, however, matters are rarely as simple as indicated 
in the table, owing to the foot that we usually meet with various combinations of 
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differences in all three of the variables at onoe. Considering the human family at 
large it must be realised that the possibilities of such combinations are practically 
infinite. Judgment in advance of the applicability of regression formulae of one 
race to a second, except in rare cases of practical agreement in respect to all 
factors concerned, would seem to be most difficult, if not impossible. Fortuitous 
combinations of differences in size, variabilities and correlations in the case of 
widely separated races may occasionally provide an equally fortuitous combination 
of regression equation constants that may yield otherwise quite unexplainable 
results in regard to applicability of the regression formulae of one to the other of 
the two races. It is suggested here, though certainly not proven, that such a 
fortuitous combination of circumstances, especially in regard to variability and 
correlations, is operative in the case of the French and the Aino. The urgent 
need of similar regression formulae for a much wider range of racial groups is vital 
to the problem in hand. 

5. Conclusions. 

A series of Chinese stature and long bone regression formulae, based on associated 
measurements of the cadaver length and dry long bone lengths of forty-eight Chinese 
male skeletons, is presented herewith (Table X). These fortnulae, together with the 
racial variables involved in their derivation, are compared with a similar series of 
regression formulae derived by Pearson from French data. 

The validity of applying regression formulae of one racial group to a second is 
tested by the trial application of each of these two sets of formulae to stature 
predictions in the case of the opposite race. An analysis of the factors underlying 
the resulting failure of the formulae of one race to give satisfactory prediction 
results for the second is then attempted. Through the application of each of these 
two sets of formulae to a third race, the Aino, the influence of such factors as 
(a) stature-limb proportions and individual limb segment proportions and (b) varia¬ 
tions in racial size, variabilities and correlations on the respective prediction results iB 
noted. Differences or similarities in stature-limb proportions or the intersegmental 
proportions of the various limb elements seem to play a minor role in determining 
the applicability of racial formulae of one race to another, although this phase 
of the subject requires much moro study than it is possible to give to it at present 
on account of the lack of suitable data. The influence of common racial differences 
in size, variability and correlation, especially of the second of these factors, is noted 
in the case of a tabulated r4sum& of the variations observed in regression constants 
incident upon variations in the underlying racial factors just mentioned. 

Lastly, the urgent need of additional data in the form of similar series of 
regression formulae based on comparable material for other races is strongly 
emphasised. 
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TABLE X. 

Chinese Statures {Cadaver Lengths) and Right Long Bone Measurements. 


Cad. No. 

Stature 

Femur 

Humerus 

Tibia 

Radius 

20 

159*2 

41*7 

30-4 

34*3 

22*3 

77 

159*6 

43*3 

30*6 

35*1 

23*9 

04 

159*7 

41*9 

29*6 

36 0 

22*8 

88 

160*0 

41*7 

29*8 

33-2 

22*5 

14 

161*0 

41*1 

28-5 

34*2 

22*7 

81 

161-6 

43*0 

32-2 

34*7 

22*7 

84 

163 1 

44*0 

30-9 

34-3 

22*3 

23 

163*7 

41*3 

29*8 

33-6 

22*5 

90 

164-7 

43*2 

30-7 

36-9 

23*8 

30 

166-0 

43-8 


34-6 

23-4 

30 

165-0 

43'8 

30-9 

35-0 

23*1 

49 

165-0 

43*5 

30-6 

36-4 

24*4 

83 

165-4 

39*9 

30-2 

34-4 

22-8 

74 

165-0 

43*4 

31*0 

36-2 

23*0 

76 

165-7 

44*1 

30-5 

35*4 

23*4 

28 

166-7 

43*4 

29-5 

36*5 

23*1 

109 

165-9 

42-5 

30-2 

35-3 

24-1 

106 

166-7 

42-6 

31*8 

36-2 

24-6 

46 

1671 

43-4 

30-0 

35-3 

23 0 

99 

168-0 

42-5 


35 1 

22*5 

34 

168-0 


31-4 

35*6 

22-9 

79 

168-2 

41-7 

31 -7 

35 1 

24-8 

95 

108-2 

44-2 

31-4 

37 2 

24-1 

37 

109-0 

42*3 

28-3 

35-6 

22-2 

50 

169-0 

44-2 


36*5 

24-0 

92 

1601 

43*1 

311 

35 0 

22-3 

48 

100-7 

42*8 

29-9 

34*9 

23-2 

18 

170-0 

44-3 

30-4 

36*2 

25-3 

73 

170-1 

45-1 

31*5 

37*2 

24-4 

54 

170-3 

43-6 

311 

37 1 

24-0 

110 

170-5 

45*5 

33-2 

37*5 

24*3 

22 

170-5 

46-7 

32-6 

38-4 

24-5 

29 

170-5 

42-6 

30-6 

35*4 

24-6 

31 

171-0 



37*8 

24-1 

17 

172-0 

44*8 


37*4 

24-5 

114 

172-2 

44-8 

..cSbeSH 

37*5 

23-7 

80 

172-5 

44*7 


36*2 

23-3 

100 

173-5 

44-4 

31*1 

36*4 

23-5 

53 

173-7 

45-2 

31 -6 

37*4 

25-0 

35 

174-0 

45*4 

31 -3 

37*2 

24-2 

72 

174-1 

45-4 

31-0 

37*0 

24*2 

69 

174-7 

44-9 

32-6 

38*3 

24-5 

75 

175-8 


33-0 

38*6 

25-9 

97 

176-5 

45-1 

30-2 

37*9 

23 7 

21 

176-6 

45-4 

32-3 

37*8 

25-7 

102 

177-1 

47*2 

33-2 

39*0 

25-3 

101 

1800 

48*0 

32-7 

39*1 

24-5 

40 

184-0 

48*8 

34-5 

o 

6 

25-8 

Mean 

168*9229 

43-9750 

31-0720 

36*2479 

23*7792 

S.D. 

5*4243 

1*7881 

1*1816 

1*5348 

0*9868 

C. of V. 

3*2111 


3-8024 

4-2343 

4*1499 
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Editorial Note on Dr Stevenson's Paper 

Note. I think there should be some hesitation in accepting all Dr Stevenson's 
conclusions. I am prepared to admit that better results for the regression formulae 
will be obtained by applying the formula peculiar to a race itself than by applying 
a formula arising from a second race. Yet the results of Table IV seem more 
divergent than I should consider possible, and become more remarkable when we 
notice how well the French formulae reconstruct Aino stature. Dr Stevenson tells 
us that the stature of the Northern Chinese (measured on 1100 individuals) was 
168*830 cms. I should expect the cadaver length therefore to be about 170*090 cms. 
The cadaver length of the 48 subjects was 168*923, corresponding roughly to a 
stature length of 167*7. The 1*1 cms. difference in stature between the general 
population of Northern China and the post-mortem room population may be 
possible; such a population is usually not a random sample from a general population. 
On the whole, however, it would be desirable to discover whether the cadaver length 
was measured by Rollet and by the Peiping anatomists in the same manner. If this 
really were so, then it must follow as far as I can see that it is the correlation of the 
vertebral column with the stature which is affecting the differentiation in the 
racial results. The sitting height index would be a sign of racial differences in the 
vertebral column, and I suggested it might possibly be a measure of whether a 
racial formula could reasonably be applied from one race to a second. To this 
Dr Stevenson gives the reply that whereas the French formulae give Aino stature 
with fair approximation, the Chinese formulae do not, although the Aino and 
Chinese sitting height indices are fairly close to each other and divergent from the 
French. To this the answer must be that we do not know adequately the sitting 
height index of the Aino, whose long bones were measured. If they were Shikotan 
Aino with a sitting height index of 58*4, then their average living stature was 
167*9 cms. (not the 156*6 of the Yezo and Sachalin Aino), corresponding to 
159*2 cms. cadaver length, which is astonishingly close to the value 159*0 of the 
French prediction. 

To surmount this difficulty Dr Stevenson introduces his Table IX, in which he 
gives arbitrary values to his stature, long bone lengths, standard deviations and 
correlations and calculates the new regression coefficients; he infers that because 
these quantities are not the same for two different races, it is due to the actual 
means, standard deviations and correlations not being the same. As I have said 
Dr Stevenson gives arbitrary values to these quantities and shows as a result that 
the regression coefficients will be modified. Let us look at this a little more closely. 

Intraracially there exists a high correlation between stature and long bones, 
also between their standard deviations and correlation coefficients; there is also 
correlation between individual standard deviations and between individual corre¬ 
lation coefficients. If the distribution be non-normal there may be correlation even 
between the means themselves and these other constants. It would therefore be 
quite impossible within the race to take a group from the population with the sort 
of changes denoted by those in Dr Stevenson's Table IX, for the odds against their 
coexistence would be in most cases excessive. 
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Now let us turn to interracial data, that is to say to tables based on racial 
means, from which the standard deviations, correlations and regression equa¬ 
tions of racial means are derived. These interracial constants are determinable, 
and have been determined in a certain number of cases. In the matter of stature 
and long bones, the correlations are likely to bo high, the race with short average 
stature will have short average long bones and the tall races will have greater average 
long bones. Further the interracial standard deviations will usually be smaller than 
the intraracial standard deviations. Means have not such a wide range of distribution 
as individuals—the individuals belonging to a race stretch out over a wide range 
which may cover the whole range of interracial means. 

It is accordingly not possible for Dr Stevenson to give arbitrary changes to his 
means, standard deviations and correlations; all these quantities in the interracial 
tables will be correlated and they cannot be measured by the scale of intraracial 
standard deviations, and treated as possible systems of change for other races. 
I feel fairly confident that a race with stature 160*0 cms. and femur length 46*4 cms., 
and one with 170*0 cms. of stature and 43*6 cms. of femur, are so improbable that 
Dr Stevenson will search the world for them in vain. The fact that interracially 
stature and femur have standard deviations of order 5*0 and 2*2 cms. respectively 
does not justify us in attributing changes of this order in combination to racial 
means. 

One further point, let us suppose that we were in possession of a multiple 
regression equation for the group of men from whom by a process of selection we 
believe all races of mankind to have sprung. Let it be 

S = Cq + C\X i + C 2 #jg 4- ... + c n x n , 

where S is the probable stature of the group in this race which has skeletal parts 
represented by x lf ^ 2 , ... x n . Then if these skeletal parts include all those which 
have been directly selected in the course of evolution, the regression equation for 
stature would remain the same in all races, although the means, standard deviations 
and correlations might change in a great variety of ways*. The regression co¬ 
efficients would be unaltered by the selection; in other words they have a stability 
far higher than that of means, standard deviations and correlations. If these 
regresssion coefficients change it must be because some other skeletal parts 
#»+i> ff»+a> ••• x m not included in our formula, but highly correlated with stature, 
have been selected. The divergence therefore between the stature-prediction 
formulae for the French and Chinese must be due to one or more skeletal 
parts, which are highly correlated with stature, having been omitted from the 
formulae. If we consider the parts of the skeleton not taken into consideration, 
and which suggest selection, we naturally turn to the vertebral column as the 
most important. Of course the pelvic and cranial heights might present appreciable 
correlations, but the first subject for study seems to me the vertebral column. At 

* Pearson, 14 On the Inflnenee of Natural Selection on the Variability and Correlation of Organs.” 
Phil Tram . A, Vol. 200,1902, p. 21. 
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present nobody knows the correlations between individual vertebrae, nor the 
correlation between any individual vertebra and the total length of the column. 
It is quite possible that it might not be needful to use all the vertebrae, but that 
the correlation of stature with the height of one or two vertebrae might be nearly 
as efficient as measuring the whole series. The investigation would be well worth 
while making, if the Chinese material extends to measurements on the vertebral 
column. 

Unfortunately Dr Stevenson being in Peiping I cannot talk matters over with 
him and I am uncertain whether his conception of bones 14 free of animal matter" 
quite coincides with the view I had in my memoir of 1898. Still I do not think 
that the corrections he has made in this respect would materially alter the difficulty 
that ray formulae while giving good results for the Aino give bad ones for the 
Chinese, but on the other hand his formulae for the Chinese give bad results for 
both French and Aino. Some light might possibly be thrown on our difficulties 
could we ascertain from hospital data the true relationship of living stature to 
corpse length for the Chinese. It is a case where far more data and far more 
research, especially as to the part played by the vertebral column, are requisite. 

K. P. 


Biometrfk* xxi 
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MEASUREMENTS OF MACEDONIAN MEN. 

By MARGARET M. HASLUCK, B.A., and G. M. MORANT, IXSc. 


I. Introduction, by M. M. Hasluck. 

The measurements analysed in the following paper were made in 1921—3 in 
what Professor Ripley once called* the “practically uuworked” field of South-West 
Macedonia. Its extreme boundaries may be given as the River Vardar (the ancient 
Axios) on the East, the Pindus mountains on the West, Mount Olympus and the 
Thessalo-Macedonian frontier on the South, and Mount Kaimakchalan and the 
Graeco-Serbian frontier on the North. The whole region was transferred from 
Turkish to Greek sovereignty after the Balkan wars of 1912—3, and it has an area 
of some seventy square miles. 

Measurements were made among six different groups of people, who have been 
called respectively Greeks, Ylachs, Christian Bulgars, Mohammedan Bulgars, Turks, 
and Greek-speaking Mohammedans. These names divide the groups into Christians 
and Mohammedans according to their religion, and into Greeks, Vlachs, Bulgars, 
and Turks according to their language. The Rumanian patois spoken by the Vlachs 
(i.e. Wallachians) is meant by the Vlach language. 

The linguistic touchstone which gave these names was the language spoken at 
home by the women. The additional language or languages spoken by some women 
and many men were ignored as adventitious accomplishments. The three Christian 
groups belong to the Eastern or Orthodox church. The vast majority of the 
Mohammedans are Sunnis, only half the small group of Greek-speaking Moham¬ 
medans and a very few Turks being Shiahs (of the Bektashi Ordor of dervishes). 
The Greek-speaking Mohammedans and the Mohammedan Bulgars are respectively 
Greeks f and Bulgars J who abandoned Christianity for Mohammedanism at least 
two hundred years ago§. From the linguistic point of view the six groups are thus 
only four. 

The Turks descend from Asiatic Turks who came to Macedonia from Asia 
Minor a little before a.d. 1400. Some of these early Turks were cavalry in the 
service of Sultan Murad I|| (r. 1360—1389), the Turkish conqueror of Macedonia, and 
of his son Sultan Bayezid If (r. 1389—1402), and others were colonists imported by 
Sultan Bayezid with their wives and children to keep down the conquered natives**. 

* Races of Europe, London, 1900, p. 422. 

t Wftce and Thompson, Nomads of the Balkans, London, 1914, pp. 29—30. 

I Colonel W. M. Leake, Travels in Northern Qreece, London, 1885, iti. p. 270.- 

§ The evidence will bo eet oat in detail in the book win''* 1 O'c -be:;,« iiu^ss to write on the folklore 
of the Greeks, Turks, and Albanians >n the WeiIoiu Balkans. 

II Chalcondyles, be Rebus Tureicis, u, p. 52 b and o—». Farther details will be found in the 
observer’s book. 

If Ibid. n. p. 62 b. ,, 


" Ibid . n. pp. 81 b, 58 *. 
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The Bulgars, coming from the Volga to the Danube and then sweeping south and 
west across the Balkan peninsula, made themselves masters of Macedonia about 
a.d. 850*. The Slavs with whom they fusedf had come there towards the close of 
the sixth century The Vlachs are not mentioned until just before A.D. 600§, but 
their Latin language shows that they are older ||. The Greeks are at least as old as 
the Vlachs. An Albanian strain of recent origin is traceable among the Greek- 
speaking Mohammedans IT, and possibly a Pecheneg (Russian) strain dating from 
A.D. 1091 among the Mohammedan Bulgars of Karajova (Moglena)**. 

Intermarriage between any two of the six groups is so rare in normal times as 
to be biologically negligibleff. The Christian women reputed to have been carried 
off to Turkish harems have not been seen in the harems by impartial witnesses and 
seem to have existed mainly in the imaginations of propagandists eager to inflame 
Christian Europe against the Turkish Government. The Christian Macedonians 
denied the existence of such women. In times of disturbance on the other hand, 
after an abortive or an actual rebellion or after a war of conquest, intermarriage 
may have occurred on a biologically important scale. For instance, all the Turkish 
cavalry of the early sultans can hardly have brought their wives with them like the 
colonists of Sultan BayezidJJ, and they may safely be presumed to have taken 
native women to wife. The same is probably true of the conquering Slavs and 
Bulgars. 

Since the Mohammedan Bulgars and the Greek-speaking Mohammedans are 
only islands in a surrounding sea of Christian Bulgars and Greeks, the six groups 
are to be regarded as only four from the geographical point of view. The Christian 
and Mohammedan members of the Greek and Bulgar groups may live side by side, 
even in the same village, but the four linguistic groups live each in its separate 
district. The Bulgars live in the Vardar valley and along the Graeco-Serbian 
frontier, that is to say, oil the eastern and northern fringes of the area investigated. 
The Turks occupy the central region, the fertile plateau that stretches from 
Sorovich past the large village of Kayalar to the towns of Kozani and Servia. The 
Greeks live on, or west of, the plateau of Anaselitza, which lies south-west of the 
Turkish area and is walled off from it by Mounts Sinatziko and Burunos. The 
Vlachs live in high-lying, sub-Alpine villages among the Pindus mountains, on the 
slopes of Mounts Sinatziko and Vermion,and on the Gumenje Balkan of Karajova, 
for they are shepherds and merchants in contradistinction to the others who are 
all agriculturists. 

The accompanying map portrays this distribution of the village population and 
attempts also to suggest the relative numbers of the different groups. As much of 

# G. Weigaud, Ethnographie von Makedonien , Leipzig, 1924, p. 16. 

+ Ibid . p. 16. i Ibid. p. 10. 

§ Wace and Thompson, p. 266. || Ibid . p. 272: of. Weigand, p. 11. 

IT The observer will give details in her future book. 

** Zonaras, Annales, xvm. p. 23. Weigand perhaps exaggerates the strength of this Pecheneg strain: 
of. his Aromunen (Leipzig, 1895), i. p. 250, and his Ethnographies p. 22, with his Ethnographies pp. 39, 66. 

tf The oase will be fully set out in the observer’s book. Ohaloondyles, n. p. 31 b. 

22—2 
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Macedonia is very mountainous or is infested with scrub, an erroneous idea of the 
population is given by maps that only shade or colour large tracts of country and 
do not mark each village distinctively. Two caveats must be entered, however. In 
the first place, the villages differ considerably in size in proportion to the fertility 
of the soil or, in the case of the Ylachs, the range of pasturage. No reliable statistics 
exist, but possible averages for the Vlach, Turkish, Mohammedan Bulgar, Greek- 
speaking Mohammedan, Christian Bulgfir, and Greek villages respectively may be 
given, though with some reserve, as 1800, 500, 350, 300, 280, and 270 souls. In 
the second place, the map only claims to represent the population as it was in 1923 
and the immediately preceding years, and it takes no note of such linguistic changes 
as Dr Weigand* and Messrs Waco and Thompson f witnessed, or of such political 
changes as the wholesale substitution in 1924 of Greek refugees from Asia Minor 
for the Mohammedans under the Convention appended to the Treaty of Lausanne 
for the Exchange of Populations between Greece and Turkey. 

The measurements were all made on villagers. The population of the towns, 
especially the Mohammedan population, was too mixed in origin to be biologically 
valuable. The Mohammedan Bulgars measured lived in the villages of Rudina, 
Polyan, Kosturyan, Kapinyan, Prabodicha, Subotsko, and Fushtan on the very 
fertile plain of Karajova (Moglena) in the North-East. The Christian Bulgars came 
from fifty-six scattered villages. To evade political difficulties most of their measure¬ 
ments were made, by the kindness of the Greek authorities, on young army recruits 
who were serving in Athens. The Turks came from the villages of Chukur Anbar, 
Shahinlar, Dedeler, Kalbujalar, Sarihanlar, Yenikeui, Ak Bunar, Koja Ahmedli, 
Kuchuk Ahmedli, Islamli, Sinekli, and Hasankeui, all of which lie immediately 
west of, or south of, Kozani. The Greek-speaking Mohammedans came from all the 
villages of their group. The Greeks measured came from thirty-seven different 
villages in the extreme West. The chief were Bogatsko near Hrupista, Yerania 
beside Shatista, Konstantziko and Zhupan in the Pindus, and Dovrunista south of 
Lapsista. The Ylachs measured came from the Pindus villages of Samarina, Smixi, 
Avdela, and Perivoli. 

In each of the Greek, Greek-speaking Mohammedan, Turkish, and Vlach groups 
two hundred heads were measured. One hundred Christian Bulgars and one hun¬ 
dred Mohammedan Bulgars were measured, but the records of forty-five of the 
latter were lost by an unfortunate accident. Only men were measured, because the 
thickness of the women's hair made the measurements of their heads unreliable. 
The men measured were between eighteen and fifty years of age, and generally 
between eighteen and thirty. 

The measurements made were stature (without shoes), cranial circumference 
(through the glabella), head (glabello-occipital) length, head breadth (greatest 
transverse breadth of the cranium), face (nasio-mental) height, face (bizygomatic) 
breadth, nose height (distance between nasion and subnasal point), and nose width 
(without pressure). The stature was measured with the sectional height standard 


Aromunen t i. p. 249. 


f Nomadt, pp. SO—81. 
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recommended by the Anthropological Institute of Great Britain. With the excep¬ 
tion of the nose width the measurements taken were made with moderate pressure. 
The colouring was recorded by eye only and not by comparison with tinted wools 
or other substances. The adjectives employed explain themselves with the exception 
of “ medium ” as applied to eyes. The indeterminate shade usually called hazel in 
England is meant. The moustache was usually lighter than the hair and was often 
bleached. 

II. Reduction of M. M. Hasluck's Material , by G. M. Morant . 

Table I gives the mean measurements of the six groups of Macedonian men. 
They are arranged in order of the cephalic indices and it will be seen that there is 
no other measurement which furnishes the same order. Not only is that so, but 
there is actually no single pair of measurements which arrange the types in the 
same way. Such a state of affairs might still be found if the means for the total 
populations were available, but the apparent lack of any high inter-racial correla¬ 
tions between the measurements is probably due in part to the fact that statistically 
small samples are being dealt with. It is clear that no reliable conclusions can be 
deduced from the comparison of single characters. 


TABLE I. 

Mean Measurements of Groups of Macedonian Men. 



Turks 

Greeks 

Greek-speaking 

Mohammedans 

Christian 

Bulgars 

Vlachs 

Mohammedan 

Bulgars 

No. of Individuals 

200 

200 

200 

100 


55 

Stature 

1079*2 ±2*91 

1672-0 +2 

*76 

1675*9 ±S 

*62 

1679*12 +3 

•81 

1686*5 +3 

*04 

1668*5 14 

•78 

Head Longth... 

180*93 ± 

*31 

183-29 + 

*30 

181 *67 ± 

*29 

183-24 + 

•46 

187-96± 

*34 

186-78+ 

•64 

Hoad Breadth 

157*65 + 

*27 

157*33 ± 

*27 

153*57 ± 

*27 

152*10+ 

•36 

155-81 + 

•28 

150*241 

•65 

Horiisontal ) 
Circumference) 

543-79 ± 

*69 

516-30 + 

*70 

538*11 + 

•69 

539*93 ± 

*93 

550*17± 

•79 

532*91 + 1 

oo 

04 

Facial Height 

124-22 ± 

*31 

121*28 + 

*36 

121*84± 

*31 

117*14 + 

*42 

119-66± 

•30 

120*791 

•47 

Facial Breadth 

142*28 ± 

*25 

140*68 ± 

*26 

135-21 ± 

•30 

137*08± 

*33 

137*48± 

*34 

136*481 

•48 

Nasal Height... 

53*95 ± 

*22 

53*26 ± 

*21 

53*06 ± 

•20 

50*85 + 

*25 

52*90 + 

*20 

52*24+ 

•36 

Nasal Breadth 

35*65 + 

*15 

35-79± 

.17 

34*83 + 

•14 

33*25 ± 

•19 

35-26 + 

*17 

36*291 

*35 

Cephalic Index 

87-20+ 

*17 

85*92 + 

*19 

84*64 + 

•17 

83*26 + 

•27 

82*98 + 

*17 

80-53 ± 

•37 

Facial Index ... 

87*40 ± 

*23 

86*20 ± 

*27 

90*31 ± 

•27 

85*47 ± 

*29 

87*31 + 

•25 

88-47± 

*47 

Nasal Index ... 

67*20 ± 

*39 

68*39 ± 

•41 

67*05 + 

•41 

65*80 + 

*40 

68*00 + 

*39 

69-95 ± 

•91 


Standard deviations for all 11, and coefficients of variation for the 8 absolute 
measurements,are given in Tabic II. A number of significant differences in variability 
may be noted, but the differences between the extreme standard deviations only 
exceed 3*5 times their probable errors in the case of the facial height and breadth, 
the nasal height and the facial and nasal indices. All the measurements of the face 
show more significant differences in variability than do the measurements of the brain- 
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TABLE II. 

Constants of Variation for Series of Macedonian Men. 



Christian 

BulgarB 

Turks 

Greek-speaking 

Mohammedans 

Mohammedan 

Bulgars 

Greeks 

Vlachs 

Mean 

Standard 

Deviations 

(from 

weighted <r 9 ) 

No. of Individuals 

100 

200 

200 

55 

200 

200 


Standard Deviations 

Staturt;. 

56-08 ± a 

*10 

CO-98 + 2 

06 

55*02 + 1 

•86 

52*58 + 3 

•38 

57-88 + 1 

*95 

63*76 ±2 

•15 

58*86 

Hoad Length ... 

6*88 + 

•33 

6*60 + 

22 

6*16-4- 

•21 

7*00+ 

*45 

6*38 + 

22 

7*03 + 

•24 

6*61 

Head Breadth ... 

5*38 + 

■26 

5-65 + 

19 

5'76 + 

*19 

7-14 + 

•46 

5*67 ± 

*19 

5*94 ± 

•20 

6*81 

Horizontal ) 
Circumference f 

13*82 ± 

•66 

14*45 ± 

49 

14*54 + 

•49 

14*12 + 

•91 

14*78 + 

•50 

16*50 + 

•56 

14-01 

Facial Height ... 

6-10 + 

•20 

6*504- 

22 

6*51 + 

•22 

5-15 + 

•33 

7*52 + 

*25 

6*32+ 

•21 

6*61 

Facial Breadth ... 

4*85 + 

•23 

5*214* 

18 

6-19± 

•21 

5 "28 + 

•34 

5*51 + 

•19 

7*05 ± 

•24 

5-88 

Nasal Height ... 

3-09 + 

•18 

4*60 + 

16 

4*23 + 

•14 

3*D4 + 

•25 

4-50+ 

•15 

4*15 + 

•14 

4*28 

N;isal Breadth ... 

2*87 + 

•14 

3*08 + 

10 

3-01 ± 

■10 

3-80+ 

•24 

3-56 + 

•12 

3-57 ± 

•12 

3*30 

Cephalic Index ... 

4*01 + 

•10 

3*50+ 

12 

3-G0 + 

•12 

4*05 ± 

•26 

3*97 ± 

•13 

3*55 + 

•12 

3*73 

Facial Index ... 

4*32 ± 

•21 

4-72 + 

16 

5*60 + 

•19 

5*18 + 

•33 

5-67 ± 

•19 

5*32 + 

•18 

5*23 

Nasal Index 

5*88 + 

•28 

8-11 + 

27 

8-56 ± 

•29 

10-03 + 

•65 

8*66 + 

•29 

8*21 + 

28 

8*27 

Mean of ir 2 /Moan <r 2 

‘830 

•047 

•976 

1-057 

1*062 

1-087 

— 


Coefficients of Variation 


Stature . 

3-39 ± 

•16 

3*03+* 

12 

3-28 + 

•11 

3*15 + 

■20 

3-46+ 

•12 

3*78± 

•13 

_ 

Hoad Length ... 

3*75-4- 

•18 

3-65 + * 

12 

3*39 + 

•11 

3*75 + 

•24 

3*48 ± 

•12 

3*74 + 

•13 

.... 

Head Breadth ... 

3*64 + 

•17 

3-58 + - 

12 

3*75 + 

•13 

4*75 + 

•31 

3*60 + 

*12 

3*81 + 

•13 

— 

Horizontal | 
Circumference} j 

8-60 ± 

•12 

2*60 + 

•09 

2-70± 

•09 

2-05 + 

•17 

2*71 ± 

•09 

3-00+ 

•10 

- 

Facial Height ... 

5-264- 

•25 

5*31 + 

•18 

5-34 + 

•18 

4-26 + 

*27 

6*20 + 

•21 

5*38 + 

•18 

— 

Facial Breadth ... 

3*54 + 

•17 

3*66 + 

•12 

4*58 + 

•15 

3*87 + 

*25 

3*92 ± 

•13 

6-13 ± 

•17 

— 

Nasal Height ... 

7-26 ± 

•35 

8’53 + 

•20 

7*97 + 

•27 

7’54± 

•49 

8*45 ± 

*29 

7’84 + 

•27 

— 

Nasal Breadth ... 

8-63 + 

•41 

8*64 + 

•29 

8*64 + 

•29 

10*47 ± 

•68 

9*95 ± 

•34 

10*12± 

•34 

— 


box. The Christian Bulgars have the lowest constants for head breadth, horizontal 
circumference, facial and nasal breadths, nasal height and facial and nasal indices. 
The Vlachs arc more variable than the other series for stature, horizontal circum¬ 
ference and facial breadth. The standard deviations in the right-hand column are 
average ones found by weighting the squared standard deviations of the series with 
the number of individuals they contain. A measure of the relative variability of 
the series based on all the characters was obtained by averaging for the 11 measure¬ 
ments the squared serial (t’s divided by these mean as squared. The Christian 
Bulgars are found to be appreciably less variable than any other population 
represented. 
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The coefficients of racial likeness between the six groups are given iii Table III* 
All the available 11 characters have been used for this purpose and it is known that 
the intra-racial correlations between some pairs are greater than 0*5. The theoretical 
condition that the measurements used should be uncorrelated, or, at any rate, lowly 
correlated with one another, is thus not fulfilled, but if a selection were made the 
number of characters would be too small. As the series contain different numbers 
of individuals, the coefficients were reduced to the values they would have if each 
sample in the comparison consisted of 100. Direct comparison may be made between 
these adjusted values. The populations dealt with live in adjoining regions and in 
close contact, but they can be distinguished easily. In spite of differences in language 
and religion, the connection between the Greeks and Turks is the only intimate 
one. The inter-relationships of the types suggested by the coefficients can be seen 
more readily in Fig. 1. The Turks and Mohammedan Bulgars are most widely 
separated, but they are connected through the other four populations. The lack of 
any close connection between the Christian and Mohammedan Bulgars is surprising 


TABLE III. 

Coefficients of Racial Likeness between Series of Macedonian Men\. 



Turks 

(200) 

K 

1 

Greek -speaking 
Mohammedans 
(200) 

Christian 

Bulgars 

(100) 

Vlachs 

(200) 

Mohammedan 

Bulgars 

(55) 

Crude Coefficients 

Turks . 

■ 


5-39 

28-85 

32*57 

35-75 

29*92 

Greeks . 

(200) 

5*39 

— 

22-20 

19*29 

14-86 

21*21 

Groek-si>eaking Mohammedans (2(H)) 

28*85 

22-20 

— 

12-67 

22-41 

10*95 

Christian Bulgars... 

(100) 

32*57 

19-29 

12-67 

— 

13-56 

11*97 

Ylachs . 

(200) 

35*75 

14-86 

22-41 

13-56 

— 

12*35 

Mohammedan Bulgars ... 

(55) 

29*92 

21-21 

10-95 

11-97 

12-35 



Coefficients roduced to nin^/(n } +?i 2 ) = 50 

Turks . 



2-69 

14*43 

24*43 

17*87 

34*68 

Greeks . 




11*10 

14*47 

7-43 

24*58 

Greek-s]>eaking Mohammedans 


14*43 

11*10 

— 

9*50 

11-21 

12*69 

Christian Bulgars. 


24*43 

14-47 

9*50 

— 

10*17 

16*87 

Vlachs . 


17-87 

7-43 

11*21 

10-17 

— 

14*31 

Mohammedan Bulgars ... 


34-08 

24*58 

12-09 

16-87 

14*31 

— 


* With the usual notation the form of the coefficient used was: 


/I n,ri t 
\Mn t \ n't 


x K-<) 8 \ 
*«■ / 


1± *67449 



The average standard deviations given in the right-hand column of Table II were used for this 
purpose, 

f All the coefficients are based on the 11 characters given in Table I. The probable errors are £-27. 
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and it will be seen that the Turks resemble the Christian Greeks veiy much more 
closely than they do the Greek-speaking Mohammedans. Several very distinct 
racial types are evidently represented in Macedonia and more abundant material 
from that region, and from neighbouring districts, would be needed to unravel their 
blood relationships. The possession of a common religion or of a common language 
gives no indication whatever of descent. The coefficients of racial likeness have 
been calculated between a number of cranial series from South-Eastern Europe *. 
The values reduced to samples of 100 each are given in Table IV for 6 of these. 

TABLE IV. 


Coefficients of Racial Likeness between Male Cranial Series from South-Eastern Europe 

reduced to = 50. 
n j + n 2 



Slovene* 

(59'6)t 

Rumanians j 
(40-0) 

Turks 

(67*0) 

Oreek* 

(89-7) 

Serbo-Croats 

(79*8) 

Magyars 

(Mediaeval) 

(27*6) 

Slovenes 

O.R.L. 


3*244- 22 

5*22 ±*22 

8*46 ±'21 

8*20±*21 

16*51 ±*20 

(59'6)t 

No. of Characters 

— 

17 

17 

20 

20 

21 

Rumanians 

C.R.L. 

3*21 + *22 

_ 

8*88±*18 

6*17 ±'19 

8*10± *19 

9-88±-21 

(400; 

No. of Characters 

17 

— 

27 

24 

24 

19 

Turks 

C.R.L. 

5*22+-22 

8*88+*18 


3*10± *19 

6*26+ *19 

5-21 ±*21 

(67-0) 

No. of Characters 

17 

27 

— 

24 

24 

19 

Greeks 

C.R.L. 

8*46 ±*21 

e-i7±-i9 

3i0± *19 

_ 

7*27+ *18 

7*46 ±*20 

(89-7) . 

No. of Characters 

20 

24 

24 

— 

27 

22 

Serbo-Croat* 

C.R.L. 

8-20+-21 

8*10±*19 

6 *26 ±*19 

7*27+ *18 

_ 

8 *32 ±*20 

(798) 

No. of Characters 

20 

24 

24 

27 

— 

22 

Magyars (Mediaeval) 

C.R.L. 

10-51 + -20 

9-88+ -21 

6*21 ±*21 

7'46 ±*20 

8*32 ±*20 


(27*6) 

No. of Characters 

21 

19 

19 

22 

22 

— 


The Turkish skulls came from cemeteries in Constantinople. The Greek series was 
made up by pooling a group of 50 specimens from Europe and 45 from Asia Minor, 
the coefficient between the two samples being -0*04 ± *18. The groups of skulls 
were drawn from regions much further apart than any in Macedonia and greater 
racial differences would be anticipated. But actually the coefficients in Table IV 
are of a decidedly lower order than the ones for the living in Table III adjusted so 
that the sizes of the samples are the same in the two cases. The difference in the 

* G. M. Morant, “A Preliminary Classification of European Baoes based on Cranial Measurements,” 
Biometrika, Vol. xi® 1928, pp. 801—375. 

t The numbers in brackets are the mean numbers of skulls (n) available for the characters used in 
computing the coefficients. 












Values of a* reduced to _ = 50 between Series of Macedonian Men . 
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number and choice of characters used may partly account for this state of affairs, 
but the evidence still suggests forcibly that the coefficient of racial likeness between 
two racial types is of a much higher order for the living head than for the skull. 
This may mean that the skeleton is of more fundamental importance than the living 
body and that it is a more reliable guide to racial constitution. It may be noted 
that the living Vlachs, Greeks and Turks are related in the same way as the cranial 
forms of the Rumanians, Greeks and Turks. The Greeks occupy the intermediate 
position and their very close resemblance to the Turks is brought out by both 
comparisons. 

The significance of the differences between the various characters considered 
singly is conveniently measured by the as found in computing the coefficients. 
Values of a less than 10 may be taken to indicate that the types are undifferentiated 
by the particular character. The a’s reduced to values they would have for samples 
of 100 each are given in Table V # . As for the skull, there are found to be profound 
differences between the average contributions which the different characters make 
to the coefficients of racial likeness. Not a single significant difference is found 
between the mean statures of the six series. The nasal index is almost as constant 
and it only serves to distinguish the two Bulgarian types from one another. It is 
interesting to find these relations since so much importance has been attached to 
the stature and nasal index in attempting to classify European races: in the case 
of this small group they are perfectly worthless characters for the purpose. More 
significant differences are found between the other facial measurements, but the 
cephalic index and head breadth are of still greater importance. The last two, 
together with the head length, control the coefficients about as much as the other 
8 characters together. The following relations are observed from a comparison of 
Tables I and V: 

(a) The stature is constant for the six types. 

(b) The nasal index of the Mohammedan Bulgars is greater than that of the 
Christian Bulgars, but no other differences are found for this character. The 
Christian Bulgars are distinguished from all other types by their small nasal height 
and nasal breadth. 

(c) The only marked differences between the facial indices are occasioned by 
the high value for the Greek-speaking Mohammedans. The facial height is 
peculiarly large for the Turks and peculiarly small for the Christian Bulgars. The 
facial breadth makes a distinction between the Turks and Greeks on the one hand, 
with their high values, and the remaining four series on the other. 

(d) The small horizontal circumference of the Mohammedan Bulgars leads to 
the most significant differences for this character, but the Vlachs show an appreciably 
higher value than the Greek-speaking Mohammedans and Christian Bulgars. 

(e) For the cephalic index 10 of the total 15 comparisons are significant; for 
the head breadth there are 9 such and for the head length 8. The arrangement of 

* After the sizes of the samples have been adjusted it may not be true to say that an a greater than 
10 indicates a significant difference, but that limit may still be adopted conventionally. 
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the types given by these 3 characters is shown in Fig. 1 and it will be seen that it 
agrees well with one which would have been suggested by the coefficient of racial 
likeness. The only disagreement between the two methods is due to the absence 


Inter-relationships of Macedonian Types. 



of a low coefficient between the Christian and Mohammedan Bulgars. The very 
significant differences between the nasal measurements of the two clearly connotes 
a racial difference which is not revealed by the head length, head breadth and 





332 Measurements of Macedonian Mm 

cephalic index. Though the last is quite the most valuable single measurement, 
it is not able, by itself, to provide a reliable guide to racial relationships. 

The relative significance of the average differences between single measurements 
of racial types has been examined for the skull for 41 series from various parts of 
Europe as in Table V above. The 10 head measurements there correspond roughly 
to cranial measurements: the facial height of the head may be supposed homolo¬ 
gous to the upper facial height of the skull ( G'H) t the facial breadth to the bizygo¬ 
matic breadth (J) and the facial index to the upper facial index (100 G'H/QB ). 
The characters are arranged in Table Y in order of their mean as. The arrangement 
for the 10 corresponding cranial characters is: nasal index, nasal breadth, nasal 
height, upper facial height, upper facial index, horizontal circumference, facial 
breadth, skull length, skull breadth and cephalic index *. The two orders are closely 
similar, so the characters which are most constant and most variable inter-racially 
appear to be the same whether the group of racial types is represented by measure¬ 
ments of the living from a small area such as Macedonia, or by skull measurements 
from Europe as a whole. 

With regard to the qualitative characters, “Shape of Nose,” “Body Build/' 
“SkinTint,” “Hair Colour ” and “ Eye Colour,” the last two classified without standard 
scales, it was clear on tabulation that very little could be done with them. Such 
classifications are worth less than measured characters on the living, as the latter 
are worth less than measured characters on the skull. Tables VI—IX contain the 
reduced data exhibited as total frequencies and as percentages. Little of real racial 
value can be deduced from these tables as they stand. For example: taking 
curliness of hair, Turks and Christian Greeks stand closest, both are significantly 
different from Greek-speaking Mohammedans and Vlachs, but scarcely from 
Christian Bulgars. On the other hand, skin tint data do not separate Christian 
Greeks from Ylachs, but separate both from the groups of Turks, of Greek¬ 
speaking Mohammedans and of Christian and Mohammedan Bulgars, all of whom 
are indistinguishable in skin tint from each other. Skin tint must therefore be 
determined by a graduated scale. 

From skin tint and curliness of hair no definite result seemed to flow and no 
further regard was paid to these characters. The other characters have sufficient 
categories just to admit of our applying the method given in Biometrika , Vol. vm. 
pp. 250—254, to measure the probability that two independent distributions of fre¬ 
quency are really samples of the same population. If this probability be represented 
by P, Tables X—XIII give the values of this quantity. At first sight it seemed wholly 
impossible to draw any conclusions from these tables—they contradicted each other 
in such a serious manner. Finally we took the means for each pair of groups for 
the four qualitative characters, “ Shape of Nose,” “ Body Build,” “ Hair Colour” and 
“Eye Colour,”—thus endeavouring to get an average probability for each pair of 
races; and treated this os a measure of their racial relationship. From this pro¬ 
cedure Table XIV (p. 335) resulted. 

* Biometrika, Vol. xx». 1028, Table XVI facing p. 886. 
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TABLE VI. Shape of Nose . 



200 Greeks 

Frequency 

Percentage 

18 

9*0 ± 1*4 

200 Greek-speaking 
Mohammedans 

Frequency 

Percentage 

19 

9*5±l-4 

200 Turks 

Frequency 

Percentage 

17 

8*5 +1-3 

200 Vlachs 

Frequency 

Percentage 

16 

8*0+ 1*3 

100 Christian Bulgars 

Frequency 

Percentage 

8 

8-0 ±1-8 

55 Mohammedan 
Bulgars 

Frequency 

Percentage 

1 

1-8 + 1-2 





Straight 

Tip-tilted 

165 

82-5+1*8 

14 

7*0 ±1-2 


157 

78-5+1-9 

177 

88-5 + 1-5 
165 

82-5 + 1-8 


6*5 ±1*2 3'0±0*8 


4 

2-0 ±0*7 


3-5±0-9 2-0±0-7 


52-0 + 3-4 40*0±3*3 




TABLE VII. Body Build and Skin Tint 



Body Build 

Skin 


Narrow 

Medium 

Broad 

Dark 

Fair 

Freckled 

200 Greeks 

Frequency 

Percentage 

60 

30*0+2*2 

77 

38-5 ±2-3 

63 

31-5±2-2 

160 

80*0 ±1*9 

40 

20*0 ±1*9 

3 

1*5 ±0*6 

200 Greek-speaking 
Mohammedans 

Frequency 

Percentage 

88 

44-0 ±2*4 

59 

29-5 ±2-2 

53 

26-5 ±2-1 

124 

62*0 ±2*3 

76 

38*0 ±2*3 

0 

200 Turks 

Frequency 

Percentage 

61 

30*5 + 2*2 

83 

41-5 ±2-3 

56 

28-0±2*l 

129 

64*5 + 2-3 

71 

35*5 ±2*3 

0 

200 Vlachs 

Frequency 

Percentage 

76 

38*0 ±2-3 

57 

28-5±2-2 

67 

33-5±2-3 

165 

82-5 ±1-8 

35 

17*5 ±1*8 

5 

2*5 ±0*7 

100 Christian Bulgars 

Frequency 

Percentage 

44 

44*0 + 3-3 

35 

35-0 ±3-2 

21 

21-0 + 2*7 

65 

65*0 ±3*2 

35 

35*0 ±3*2 

0 

55 Mohammedan 
Bulgars 

Frequency 

Percentage 

24 

43*6 ±4*5 

14 

25 *5 ±4-0 

17 

30*9±4*2 

34 

61 *8 ±4-4 

21 

38*2±4*4 

0 


TABLE VIII. Colour of Hair. 



Black 

Dark Brown 

Brown 

Bed 

Fair 

Curly 

200 Greeks 

Frequency 

Percentage 

28 

14*0 + 1*7 

134 

67*0 ±2*2 

30 

15*0± 1*7 

1 

0*5 ±0*3 

7 

3*5 ±0*9 

12 

8*0±1*1 

200 Greek-speaking 
Mohammedans 

Frequency 

Percentage 

18 

9*0± 1*4 

114 

57*0±2*4 

42 

21*0 ± 1*9 

1 

26 

13*0 ± 1*6 

23 

11*5 ±1*6 

E 

Frequency 

Percentage 

2 

1*0±0*5 

125 

62*5 ±2*3 

56 

28*0 ±2*1 


17 

8*5 ±1*3 

9 

4*5 ±1*0 


Frequency 

Percentage 

73 

36*5 ±2*3 

87 

43*5 ±2-4 

25 

12*5 ±1*6 

0 

15 

7*6 ±1*3 

40 

19*0 ±1*9 

100 Christian Bulgars 

Frequency 

Percentage 

8 

8*0 ±1*8 

47 

47*0±3’4 

31 

31*0±3*1 

2 

2*0 ±0*9 

12 

12*0 ±2*2 

8 

8*0±1*8 

; 55 Mohammedan 
^ . Bulgars , __ 

Frequency 

.Percentage 

8 

_.H’5±3:3 

* 26 

47*3 ±4*3 

18 

32*7 ±4*5 

m 

2 

3*6±1*7 

0 
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TABLE IX. Colour of Eyes , 



Dark 

Medium 

Light 

200 Greeks 

Frequency 

Percentage 

110 

55-0 ±2*4 

66 

26-0+2*1 

34 

17*0± 1*8 

200 Greek-speaking 
Mohammedans 

Frequency 

Percentage 

72 

36-0 ±2-3 

71 

35-5 ±2-3 

67 

28*5 + 2*2 

200 Turks 

Frequency 

Percentage 

91 

45-5 + 2-4 

66 

33-0 ±2*2 

43 

21 *5 ± 1*9 

200 Vlaohs 

Frequency 

Percentage 

95 

47*5 ±2-4 

59 

29*5 ±2*2 

46 

23-0 ±2-0 

100 Christian Bulgars 

Frequency 

Percentage 

61 

61*0 + 3-3 

14 

14-0 ±2-3 

25 

25-0 + 2-9 

55 Mohammedan 
Bulgars 

F requeticy 
Percentage 

19 

34*5 ±4*3 

10 

18*2 ±3-6 

26 

47*3 ±4*5 


TABLE X. Shape of Nose (5 Groups). Values of P. 



Greeks 

Greek-Bpeaking 

Mohammedans 

Turks 

Vlaohs 

Christian 

Bulgars 

Mohammedan 

Bulgars 

Greeks 

— 

•2767 

•0812 

*0981 

•0000 

•3464 

Greek-speaking 

Mohammedans 

•2767 

— 

•0110 

•5600 

■oooo 

•1172 

Turks 

•0812 

■0110 

— 

•0417 

•0000* 

•1663* 

Vlachs 

•0981 

•5600 

*0417 

— 

•0000 

•1606 

Christian Bulgars 

•0000 

*0000 

•0000* 

•0000 

— 

•0000+ 

Moh&mmedau Bulgars 

•3464 

*1172 

•1663* 

■1606 

•0000+ 



* For 4 groups. + For 3 groups. 


TABLE XI. Body Build (3 Groups). Values of P. 



Greeks 

Greek-speaking 

Mohammedans 

Turks 

Vlaohs 

Christian 

Bulgars 

Mohammedan 

Bulgars 

Greeks 

— 

•0144 

•7466 

•0826 

•0384 

•1079 

Greek-speaking 

Mohammedans 

*0144 

— 


•2886 

•4963 

•7844 

Turks 

*7466 

•0110 

— 

*0248 

•0673 

•0740 

Vlachs 

*0825 

•2886 

•0248 

— 

•0013 

*7745 

Christian Bulgars 

•0384 

•4963 

•0673 

•0013 

i 

•3053 

Mohammedan Bulgars 

*1079 

•7844 

•0740 

•7745 

•3053 

i 
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TABLE XII. Hair Colour (5 Groups). Values of P. 



Greeks 

Greek-speaking 

Mohammedans 

Turks 

Vlaohs 

Christian 

Bulgars 

Mohammedan 

Bulgars 

Greeks 

— 

•0000 

•0000 

•0000 

•0001 

•0216 , 

Greek-speaking 

Mohammedans 

•0000 

— 

•0002* 

•0000* 

•0882 

•0771 

Turks 

•0000 

•0002* 


•0000* 

•0015 

•0000 

Vlachs 

•0000 

*0000* 

•0000* 

..... i 

•0000 

•0002 

Christian Bulgars 

•000] 

•0882 

•0015 

•0000 

— 

•3746 

Mohammedan Bulgars 

•0216 

•0771 

*0000 

*0002 

•3746 



* For 4 groups. 


TABLE XIII. Eye Colour (8 Groups). Values of P. 



Greeks 

Greek-speaking 
Mohammedans 

Turks 

Vlachs 

Christian 

Bulgars 

Mohammedan 

Bulgars 

Greeks 


•0004 

•1645 

•2266 

*0169 

•0001 

Greek-speaking 

Mohammedans 

•0001 


•1162 

•0675 

■0000 

•0127 

Turks 

•1645 

•1162 

— 

•7718 

•0019 

•0006 

Vlachs 

•2266 

•0675 

•7718 

— 

•0110 

•0018 

Christian Bulgars 

•0160 

•(Km 

•0019 

•0110 


•0052 

Mohammedan Bulgars 

*0001 

•0127 

•0006 

•0018 

•0052 

~ 


TABLE XIV. 

Probabilities that each pair of Macedonian Groups might have been samples of the same Race . 



Greeks 

Greek-speaking 

Mohammedans 

Turks 

Vlachs 

Christian 

Bulgars 

Mohammedan 

Bulgars 

Greeks 

— 

•0729 

•2481 

•1018 

•0138 

•1190 

Greek-speaking 

Mohammedans 

•0729 


•0346 

•2140 

•1461 

•2478 

Turks 

•2481 

•0346 

— 

•2096 

•0177 

*0602 

Vlaohs 

•1018 


•2096 

— 

•0031 

•2343 

Christian Bulgars 

•0138 

•1461 

•0177 

•0031 

— 

•1713 

Mohammedan Bulgars 

•1190 


*0602 

•2343 

•1713 

— 
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Without laying much stress on this table, we may draw the following results 
from it: 

(i) The Greeks are most like the Turks. 

(ii) The Greek-speaking Mohammedans are most like the Mohammedan 
Bulgurs. 

(iii) The Turks are most like the Greeks. 

(iv) The Vlachs are most like the Mohammedan Bulgurs. 

(v) The Christian Bulgurs are most like the Mohammedan Bulgurs. 

(vi) The Mohammedan Bulgars are most like the Greek-speaking Moham¬ 
medans. 

The whole of these results would flow from the first series of crude coefficients 
in Table III on p. 327. The other values in Table XIV for order of resemblance 
have considerable correspondence with the same values in Table III. Thus these 
qualitative characters may be said generally to give such support as lies in them to 
the relationships deduced from the measured characters first discussed. 

In conclusion the observer begs to offer her grateful thanks to the successive 
royalist and republican governments of Greece for the facilities which they gave 
her in spite of the political and military difficulties of 1921—3. She thanks Pro¬ 
fessor R W. Reid, too, for his kindly help, and the statistician and she could scarcely 
have compiled this paper without the practical interest of Professor Karl Pearson, 
while they have also to thank Miss Ida McLearn for the preparation of the figure 
and map. 




Greek-speaking Mohammedan (Chotil). 
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Mohammedan Turk (Sofular), anoeetry probably intermarried with Chrietiane. 







Plate III 
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Christian Bulgar of stock type (Pateli). 







Plate IV 
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Christian Vlach of fair type (Samarina). 
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(1) Introductory. 

Suppose that we are considering the distribution of a single variable, x , and 
that the population sampled is divided into a groups such that in the rth group 
x is normally distributed with standard deviation a about a mean x r . In general 
x u $ 2 , ... x a are not equal, although in a special case they may be so. A sample of 
N is now drawn in which n x individuals are taken randomly from the first group, 
n 2 from the second, and so on, where 

wi + w«-f ... + n a = N .(1).. 

Estimates X Xi X 2 ,... X a are made from the sample of the true population group 
means x x , , ... x ay and 

u = .(2) 

is calculated. Then if the quantities X r have been obtained in a suitable manner, 
it can be shown that the distribution of u in repeated samples of N* follows the 
Type III law 

N-e-2 
- - 

f (u) du = constant xu 2 e du .(3), 

where c will depend upon the method of estimation of the X’a. For example, if X r 
is the mean of the n r values of x sampled from the rth population group 

(r= 1, 2, ...o), 

then c«a; or if in the population xi=x % =...=x a , and we take Xi=X i — ...^X a 
= mean of the N sample values of x, then c = 1. It will be noted that (3) gives 

Mean u = N - c, <r u = V2(iV-c). 

* That is to My, samples in which n r individuals are drawn randomly from the rth population 
group (r=l, 3,... a),remaining fixed. 
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Some Notes on Sampling Tests with Two Variables 

Dr R. A. Fisher has based a number of simple but important statistical tests 
on the equation (3), which he classes under the heading of “ Analysis of Variance* " 
The expression 

8 ( 8 {xrt- X r f\ /(V-c) = u<t*/(N - c) . (4) 

r-l (t=l )/ 

he describes as an estimate of the population variance, cr*, based upon AT —c degrees 
of freedom; its mean value in repeated samples is seen to be <r a . 

The expression u, containing as it does the population <r a , is not of much direct 
value if this quantity be unknown, but in a number of problems the appropriate 
criterion to use is the ratio 0^u'/u, where v! is a quantity similar to the u of (2) 
but based upon an independent estimate of cr 2 , such that 

jv-c'-2 , 

f(u) du = constant xu' 2 e du' .(5). 

6 is now independent of a 2 and, as Fisher has shown (also it may be easily proved 
from (3) and (5)), if u arid v! are uncorrelated, then in repeated sampling 0 is 
distributed according to the Type VI law 

N' - c' - 2 N h N f - c - c* 

f(0) d0 ~ constant x 0 2 (1 + 0) 2 d0 .(6). 

Here the constant term is independent of a 2 . In dealing with this distribution 
Fisher uses the transformation 



that is to say he takes z as half the difference of the natural logarithms of the 
two estimates of variance. He has given tables showing for different values of 
Wi = N — c and n 2 = N ' — c' the value of z t the chance of exceeding which is *05 and 
•01 f. It will be noticed that by writing J = 0/(1 + 0) } (6) may be transformed into 
the Type I distribution 

N'-c'-2 N -c-2 

f (f) rff = constant x f 2 (1 - f) 2 d?.(8), 

whose probability integral depends upon the Incomplete Beta Function. 

The various tests based upon the frequency law (6) depend upon the variables 
being normally distributed. As soon as non-normality is introduced the distribution 
of 0 will be modified in a direction varying with the particular test. Not only may 
(3) and (5) be no longer applicable, but u and u will in certain cases be correlated 
where previously they were independent. In the present paper it is proposed to 
examine how far deviations from normality are likely to affect one of Fisher's tests, 
that for the goodness of fit of regression curves. The experimental results used to 
illustrate this point will also help to throw some light on the distribution of the 
correlation coefficient in small samples from non-normal populations. 

* A description of these tests is given in a paper entitled “ On a Distribution yielding the Error* 
Functions of several well known Statistics,'’ read before the International Mathematical Congress at 
Toronto in 1924 but only recently published. The methods of application without the mathematical 
framework are given in Dr Fisher’s Statistical Methods for Research Workers , 1926 and 1928, pp. 178 et seq. 

t Table VI, Statistical Methods for Research Workers. This z must be distinguished from the original 
z of “Student’s” test which Fisher writes as t/^n. 
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(2) The Application of the Principle of Likelihood. 

In two recent papers an attempt has been made by Dr J. Neyman and the 
author to connect together the various tests that are applied in different sampling 
problems by deducing from a common basis the criterion appropriate in each case*. 
It will perhaps be of interest to illustrate the use of this method in a further 
instance. The problem of the goodness of fit of regression curves presents itself 
commonly in the following form. We have before us a sample, 2, and wish to know 
whether it is likely that this has been drawn from a population, II, for which the 
curve of regression of y on x, let us say, follows a law 

Y x = F(x; «i, a a , ... a c ) .(9), 

where F is of given form but the constants a are unspecified. That is to say, we 
are testing what has been described as a “composite hypothesis”; it would become 
a “simple hypothesis” only if the values of a were specified in advance. What is 
the appropriate criterion to use ? In the case where c = 1, and we suppose that in 
the population Y x is constant, should we take the correlation ratio rj Vx ? And when 
in the population Y x is supposed to lie on a sloping straight line (c = 2), should we 
consider tj — r or 7j 2 — r 2 or even (as one of Blakemans alternativesf) the ratio of 
rj to r ? The general problem in which the array distributions may be of any form 
would probably be extremely difficult to solve, at any rate for small samples, but 
the solution in one important case—that in which the arrays of y for constant x 
are homoscedastic normal curves—can be obtained. And here the principle of 
likelihood appears to provide a method of finding the appropriate criterion. 

2 is a sample of N in the form of a correlation table, for which the marginal 
totals, the means, and the standard deviations in the a y-arraya arc respectively 
n xy y x> and s x (x = 1 , 2, ... a). The set flj of all possible populations from which 2 
may have been drawn is that in which the y-arrays are homoscedastic normal curves, 
but the regression of y on x as well as the distribution of the ^-arrays and of both 
marginal distributions may be of any form whatever§. The sub-set o> of H contains 
the populations for which the regression of y on x is given by the law (9). As a 
step in measuring the probability that 2 is a sample from a member of o> we shall 
find the likelihood of this composite hypothesis. Let II be a member of f1 for 
which the standard deviation in the y-arrays is <r, and the proportions in the 
marginal totals of these arrays are Pi, p*> - --Pa, where of course 

JPiH-i>a+...+Pa«l.(19). 

* “On the Use and Interpretation of Certain Test Criteria for Purposes of Statistical Inference,’* 
Iiiometrika , Vol. xx A . pp. 175—240 and 264—294. 

f Iiiometrika , Vol. iv. pp. 382—850. 

t This terminology was explained in Biomctnka, Vol. xx A . pp. 268—265. 

§ There is no need for the 2 -variate to be continuous; in faot, if it be, the distribution in the ^-arrays 
is only likely to be normal if the number of arrays, a, be fairly large. If, for example, dx be the breadth 
of a i/-array and we are dealing with a bivariate normal surface, then the array distributions wiU only 
be striotly normal in the limit as dx tends to zero. 


23—2 
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Then the chance of drawing from II a sample 

(1) in which n x individuals come from the xth array (#=» 1 , 2 ,... a), 

(2) where within the array the observations lie between the limits — 
and y tx + (< = 1, 2,... n», x — 1,2, ... o), 

will be in the limit as h-*-0 asymptotic to 

.< u >- 

Taking logarithms we find 

log G = constant + S (n x log p x ) — N log a - ~ ] 8 (n x 8 x * + n x (y K - T*) 1 )^ / 
x=i ^ U»i )/ 

where the constant term is a function of h and the sample frequencies only. To 

determine II (fl max) we maximise log C with regard to the variables a, Y x and 

p a .(# = 1,2,... a). The result gives 

8 ( n u s*)/N } 

X-\ I /lov 


Y m — y x > (*=1, 2,...a) 

p x =n x IN,(x= 1,2,... a) 


and hence as h 0, 




(\/ 2 t r) N 


To determine II(a>max) we maximise log G with regard to <r; cti, cr 2 , — ct c ; 
Pit Pi> • •• p a ; where the as are the c undetermined parameters contained in the 
expression (9). The solution is now 

<7* * 8 {n x s* + K* (y x - Y x )*}/N .(14), 

x-l 

p x = n m /N, (x = 1, 2,... a) .(15), 

and for Y x we have the values obtained by the solution of the equation 


^{H.(yx-F e )^} = 0, (<>■= 1, 2,... c) .(16), 


which are the same as those found by minimising 8 {n x (y x — l 7 *) 1 }, or from fitting 

x-l 

(9) by least squares to the observations. Inserting these values into (11) we obtain 
an expression for C(a> max) identical with (13) except that the sample array means 
and standard deviations occur in a term of form 

g \ n*8x+n x (y x -Y x Y \^ 

It follows that the likelihood of the composite hypothesis becomes 


(7(a) max) 
(7(fl max) 


Sn 9 (y x -Y u ? -2 


8 (n x 6*) 
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the values of Y„ being obtained by fitting (9) to the weighted sample array means 
by the method of least sqm res. Following a common notation, if be the standard 
deviation in the y-margin of the sample, we may write 


8 («.*„*)-i\T(l 

8 {n x (y x -Y lc )'}=N(r lyx *-R>)8S 




so that 


. «•> 


The hypothesis to be tested is most likely to be true when R»y Vx and X® 1, 
and becomes more and more improbable as X decreases. The completion of the 
solution depends upon finding the distribution of X in sampling from a member 
of w. This has been done by R. A. Fisher and is a special case of his general 
distribution ( 6 ) given above (p. 338). The quantity whose distribution he has 
obtained is not X, but a function of X which we may call 0, defined by the relation 

\=(i+oy iN , 


or 


_ t p* 8 K(y«-W 

a Vvx " _ *«i 

! - *,> ~ SL 


.( 20 ). 


8 (n*«, 2 ) 
*=1 


As X varies from 1 to 0, 0 varies from 0 to oo. If we divide the denominator 
of the expression for 0 by N—a> it will be seen that it becomes the ratio of the 
weighted sum of the squares of the deviations of the sample array means from the 
fitted regression curve to a weighted estimate of the population array variance. 
Without therefore introducing the idea of likelihood, 0 appears to be a natural 
criterion to use in judging the deviation of the observed regression from expected 
type. 


(3) The Sampling Distribution of 0. 

The proof has been given by Fisher in somewhat condensed form *. It may be 
divided into the following steps: 

(a) The set of all possible samples, T, from a population Ft can be divided into 
a number of sub-sets within any one of which, say 7 , the totals of the y-arrays have 
a fixed series of values n lt w*,... n a . The chance of drawing a given sample 2 from 
II can then be represented by the product of (1) the chance that 2 belongs to 7 , 
Nl 

or —r-;(pi)^-..(Pa) na , “d ( 2 ) the chance of obtaining the observed 

7lj I . •. fl a I 

value of the variates on drawing a random sample of % from the first population 
array, w* from the second, and so on. The solution is simplified immensely by first 
obtaining the distribution of 0 among the samples of a single sub-set. 

* Journal of the Royal StatUtical Society , Vol, lxxxv. pp. 697—611. 
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(b) Take k = S \n a {y x - F*)«j/<r» .(21). 

X=l 

Then within the samples of the sub-set 7 , if y x be a true population array mean, 
^.(y.-fc) is a quantity normally distributed about zero with standard deviation 
a. The sum contains the squares of a such quantities. The effect of using Y x , 
found by fitting a regression curve to the observations, instead of y xy can be shown, 
at any rate in certain important cases, to give for the distribution of k a curve of 
the form of (3), where N — a , the number of arrays, and c is the number of constants 


in the fitted regression curve (9)*. That is to say we have 

1 zlzl k 

f ( k) dk = constant x k 2 e dk .( 22 ). 

(c) Take ?= 8 \ S (y#t-y x f [A 2 * 5 s (^a 2 )/ 0-2 .( 23 )- 

x -1 U=1 J / sr-1 


This is the special case arising from equations (2) and (3) referred to on p. 337 above, 
where X r is the mean of the group of n r observations. The distribution of q is of 
form ( 3 ); there are N observations and the number of groups is a, hence 

N-a -2 

f ( q ) dq = constant x q 2 e dq .(24). 

(i d ) Finally within 7 , as the population y-arrays are normal, k and q are inde¬ 
pendent, the first depending only on the variation in means, the second on that of 
standard deviations. It is therefore easy to obtain from ( 22 ) and (24) the distribu¬ 
tion of 0 = kjq> namely, 

a~c-2 N-c 

f (0) d0 = constant x 0 2 (I -f 0) 2 d0 .(25). 

This distribution is not only independent of a but also of the array totals 
»i, ... n a . It will therefore hold within all sub-sets 7 , and hence for the aggregate 

of all possible samples T. The probability integral of (25) provides in fact the 
means of testing the hypothesis regarding the form of the regression curve. Various 
methods of obtaining this probability integral will be considered below. 

(4) The Effect of Non-normality. 

I)r Fishers test can be used in examining the goodness of fit of linear and 
non-linear regression curves, but it has involved two large assumptions, first that 
the distributions in the y-arrays are normal, and next that they have the same 
standard deviations. In cases of non-linear regression it is often found that the 
array standard deviations change, while the form of the curve may pass from 
symmetry through increasing degrees of skewness, With linear regression the 
assumptions are more likely to be justified. As the population diverges from 
normal form the test will become less and less efficient, partly because the criterion 

* This result appears to be exact provided that the constants a ,, a. 2 , ...a e appear in (9) in linear 
form; for example, if the curve be a parabola, or even a hyperbola of type Y x =~ +.. .4- . In the 
more general case it may perhaps be only true as an approximation. 
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6 is no longer the most appropriate one to use, and partly because its sampling 
distribution will cease to conform to (25), but it would be almost impossible to say 
at what point it becomes invalid. The practical situation seems, however, to be this; 
the statistician who is not dealing with very large samples has often no means of 
judging the exact form of his population distribution. It is therefore of first 
importance that he should feel some confidence that moderate deviations from 
normal homoscedasticity will not make worthless any conclusions which he may 
draw by referring z to Fishers tables or 6**kjq to the distribution (25). The 
problem is a large one, seeing in how many directions non-normality may arise, but 
a simple illustration will throw some light upon it. 

Suppose that the distributions in the y-arrays of the population are homoscedastic 
non-normal curves with the frequency constants <r, and If the means of the 
population arrays, y Xi were known, we could calculate 

S N(y,-y,)*l/<r*= 8 (v 2 )/* 2 .(26). 

x=»l x~\ 

Within the sub-set of samples, 7 , defined on p. 341 above, v x will vary about zero 
with standard deviation <t and with a second “beta coefficient” defined by 

x B 2 = 3 + ($2 — 3)/ n x . 

It follows that in repeated sampling within 7 , 

Mean k' = a, 

Mean (k') 2 = j S (Mean v x 4 ) + 28 '(Mean fl ® 2 V*)J jo** 

— 8 ( x B 2 ) + 2 S' (Mean v 2 x Mean v*}/o - 4 

x^\ 

(since within 7 , v x and v x > are uncorrelated) 

= S CA) + a(a-l) = a a + 2 a + (&-3) § (~). 

X-l X-\ \"xJ 

Hence o* a = Mean (k'f - (Mean kj = 2 a + (& - 3) 8 (-) .(27). 

It is seen that the mean value of k' is the same whatever form be the population 
array, but (27) shows that the variability of k ' depends upon /3 2 , and further is not 
the same within each of the sub-sets 7 , varying according to the marginal totals n x . 
The second term of (27) will usually, however, be very small compared with the first, 
and we may conclude that unless the population arrays are extremely leptokurtic 
or the sample very small, the distribution of k' will not differ seriously from that 
of “normal theory.” The quantity with which we are really concerned is the k of 
( 21 ), obtained by using the F^s of the fitted regression curve; its variability would 
appear harder to determine, but it seems likely that just as for k f the equation ( 22 ) 
will represent its sampling distribution with fair accuracy provided the sample is 
not very small or the array f3 2 large. 

* S' implies the summation for all possible pairs out of the a-arrays. 
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We may now consider the modifications connected with the q of (23). Within 
the sub-set y, we know that 
. (Mean («**) = (»« -1) 

(Mean (a*) = (n«-1) {(«,+ (n* - 2n« + 8)} 

Hence Meang = S (n*Mean= S (n B —o, 

x=\ x=l 

Mean q*= S {«.** Mean a*}/a* + 2 S' {n K n^ Mean s x 2 x Mean 

£C= 1 

as within 7 , s x 2 and s^ are uncorrelated. Substituting the values for Mean 8 X 2 and 
Mean s x 4 it is found after reduction that 

Mean </*= (JV- a) 8 + 2 (W - a) + (ft - 3) jiV- 2a + 5 , 

» . 

Again the mean value of q is independent of the population array form, but a q 
differs from the “normal theory” value of V2 (N—a). Although the third term 
within the brackets in (28) may be small, the second term will often not be 
negligible compared to unity. For example, if # 2 = 4, and we are dealing with very 
large samples, this second term will be of the order of 0*5. We must conclude 
therefore that if the population array distributions are distinctly platykurtic or 
leptokurtic, the variability of q will be affected and the “normal theory” law (24) 
begin to fail, although still giving the correct mean for q . The denominator of the 
ratio 0 = k/q is in fact more sensitive to changes in population form than the 
numerator. If the array curves are skew, another feature is introduced owing to 
the correlation between deviations in mean and variance; that is to say y x and 8 X 
will be correlated. This will lead to a correlation between k and q which, provided 
that it is positive*]*, should have in the ratio 6 somewhat the same compensating 
effect as in “Student’s” ratio z when the population is not normalJ. 

(5) Sampling Experiments. 

To illustrate further the result of non-normality in this and certain other problems 
two series of sampling experiments have been carried out. The first, in which the 
arrays were both normal and homoscedastic, does no more than confirm the un¬ 
questioned accuracy of “ normal theory” as set out in equations (22), (24) and (25), 
but it will be of more value in connection with the distribution of r. In the second 
experiment the standard deviation in the arrays was varied and the distributions 
were taken to be Type III curves. 

* This value for the mean of the square of the variance is taken from Dr Church's paper in 
Riometrika , Vol. xvn. p. 81. 

t If the array distributions are leptokurtic this correlation will presumably be positive. If, however, 
they were for instance “ rectangular,” large deviations in y x would be associated with low values of 
leading to a negative correlation between h and q which would tend to increase the variability of 8, 

X See Biometrika , Vol. xxi. p. 259 et »eq. 
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Experiment I. 

The population contained three arrays (a = 3) with proportions pi *= *40, p,—*85, 
Pt =■ *25. The three array distributions were normal and homoscedastic, and the 
regression of y on x was linear, the coefficient of correlation being p «■ ‘5346. The 
sampling was carried out with the help of Tippett’s Random Numbers*, the 
grouping unit for y being £ of the array standard deviation. 200 random samples 
of 20 were taken and k, q and 0, as defined above, calculated in each case. In fitting 
a sloping regression straight line to each sample we are using a law (9) of form 


F* = «i + a t x .(29), 

that is to say c = 2, while N = 20, a = 3. 

Distribution of k. 

Equation (22) becomes 

f(k)dk = constant xk~l e~^ k dk .(30), 

which is the distribution of with n' =* 2. The following results were obtained: 

Mean k ; Theory 1*000, Observation 1*109, Standard Errorf 0*100. 

<r k ; „ 1*414, „ 1*538, „ 0*187. 

The Goodness of Fit Test, using 11 groups, gave P = *416. 


Distribution of q. 

Equation (24) becomes 

/(?) dq ** constant x q V e~4? dq .(31), 

or the distribution of with n — 18. The following results were obtained: 

Mean q ; Theory 17*000, Observation 17*035, Standard Error 0*412. 

<r,; „ 5*831, „ 6*586, „ 0*339. 

The Goodness of Fit Test, using 16 groups, gave P = *982. 


Distribution of 0 = k/q. 

Equation (25) becomes 

/ (6) dd — constant x 6 ~ * (1 + 0)~* dd .(32). 

Using the transformation f = 0/(1 + 6) we obtain the Type I distribution 

/(f) df= constant x f~4(l - f)* df.(33), 


whose probability integral depends on the Incomplete Beta Function, 
general distribution (25), 


Mean 6 = —- 

N —a — 


1 /2(a —c)(i\T—c —2) 

Ct ~N-a-2 V JV — a —4 


For the 
...(34). 


* Tracts for Computers , No. xv. 

t The standard errors are for Mean k a nd <r k o alenlated from 300 samples. The first is o k l,jN, and 
in the second ease the approximation jpr* J(Pi~ been used, where here N=*2 00 and fa refers 

to the theoretical distribution of k whioh for a x a distribution has a value of 8+12/(n'-l). Similar 
expressions are used for g and $, 
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Using those values with N = 20, a *= 3, c = 2, it was found that 

Mean 6\ Theory *0667, Observation *0727, Standard Error *0074 
<r„; *1046, „ *1074. 

The distribution of 6 is a J-curve, and the expression used above for the standard 
error of a standard deviation will hardly be satisfactory. Using the transformation 
to a Type I curve, and comparing theory and observation for 11 groups, the Good¬ 
ness of Fit Test gave P =* *409. 

Correlation between k and q. 

As we should expect, there is no evidence for such a correlation. The observed 
values for the 200 samples are 

r*--*0342, V=-l 227 - 

The standard error for r kq on “normal theory” is 1/V200 — 1 = *0709, while if we 
may consider the arrays of q in the k , (/-correlation table sufficiently nearly normal 
to apply the test we are now discussing and as described in section (6) below, then 
equations (44) give Mean rj 2 = -1156 and or n 2==*0319, so that the observed value of 
*1227 is not significant. 

Taken collectively these results show an admirable agreement between observa¬ 
tion and theory. 

Experiment II. 

The population contained five arrays (a = 5) with proportions and array standard 
deviations as follows: 

Array 1 2 3 4 5 

p x *1667 *2666 *2167 *1833 *1667 

<r x 5*7471 5*3191 5*0000 4*7619 4*5872 

The standard deviations are in terms of the grouping unit employed for the 
sampling. The regression was linear and the coefficient of correlation was 
p as + -4626. The distribution in each array followed a Type III curve with 
^ssO‘20, £ 2 * 3 * 30 . If x is taken to be increasing as we pass from Array 1 to 
Array 5, and p is taken as positive, then these curves were negatively skew, the 
steeper tail pointing in the direction of increasing y. 300 random samples of 30 
were now drawn, again using Tippett's Random Numbers. A sloping straight line 

Y x = «i + <* 2 # .(29 bis ) 

was fitted to each sample, so that N = 30, a = 5, c = 2. 

Distribution of k . 

The population array standard deviations vary, but the weighted mean of 
the variances, or a 2 = 26*1306, has been substituted for a 2 in the expression for 
k t (21), and also later in that for q f (23). Equation (22) becomes 

f{k) dk * constant x $ dJc 


(35). 
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The following results were obtained: 

Mean k\ Theory 3*000, Observation 2*842, Standard Error* 0*141. 

<r*; „ 2*449, „ 2*457, „ 0*173. 

Theoretical frequencies were obtained from the Tables of the Incomplete Gamma 
Function , taking p» 0*5 and testing for goodness of fit with 14 groups 

it was found that P = *329. A comparison of cumulative frequencies is given in 
Table I. 

Distribution of q. 

Equation (24) becomes 

/(?) ^2 3=1 constant x q** e ~ 1? dq .(36). 

The following results were obtained: 

Mean q ; Theory 25*000, Observation 25127, Standard Error 0*408. 
c q \ „ 7*071, „ 8*052, „ 0*321. 

Theoretical frequencies were again obtained from the Incomplete Gamma Function 
Tables taking p * 11*5, u = 5 /V50, and on testing for goodness of fit with 13 groups 
it was found that P = *474. 


TABLE L 

Frequency Distributions from Experiment II. 


Distribution of k 

Distribution of q 

Distribution of 250 

k greater 
than: 

Observa¬ 

tion 

Normal 

Theory 

q greater 
than: 

Observa¬ 

tion 

Normal 

Theory 

250 greater 
than: 

Observa¬ 

tion 

Normal 

Theory 

o-o 

300 


8 

300 

299*8 

0*0 

300 

300*0 

0*5 

272 

275-7 

10 

297 

299*0 

0*4 

281 

281*4 

1*0 

237 

240*4 

12 

293 

296*0 

0*8 

245 

254*2 

1*5 

192 

204-7 

14 

283 

288*5 

1*2 

219 

225*8 


158 

171*7 

16 

269 

274*4 

1*6 

194 

198*6 

2*5 

128 

142*6 

18 

244 

252-7 

iiK&M 

171 

173*6 

3*0 

111 

117*5 


216 

224*0 

■ 

141 

151*2 

3*5 

87 

96*2 

22 

183 

190*7 

2*8 

124 

131*3 

4*0 

66 

78*4 

24 

154 

155-8 

3*2 

97 

113*8 

4*5 

53 

63*7 

26 

122 

122-3 

3*6 

79 

98*6 

5*0 

42 

51-5 

i 28 

96 

92*4 

4*0 

73 

85*4 

5*5 

37 

41-6 

30 

68 

67*3 

4*4 

63 

73-9 

6*0 

32 

33*5 

32 

54 

47*4 

4*8 

57 



26 

21*6 

1 34 


32*4 

5*2 

47 

55*4 

8*0 

13 

13*8 

36 

29 

21*5 

5*6 

42 

48*0 

9*0 

6 

8*8 

38 

22 


6*0 

37 

41*6 

10*0 

5 

5*6 

40 

17 

8*8 

6*4 

31 

36*1 


4 

3*5 

42 

12 

5*4 


21 

27*2 




44 

10 

3*3 

8*0 

17 

20*6 




46 

7 

2*0 

8*8 

15 

16*6 




48 

2 

1*2 

9*6 

11 

11*9 







10*4 

8 

9*1 







11*2 

6 

7*0 







12-0 

4 

5*4 


The standard errors were calculated as for Experiment I, using N=300 (see footnote to p. 845). 
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Distribution of 0** k/q. 

Equation (25) becomes 

/( 0) d0 = constant x 0* (1 + 0)~ u d0 .(37). 

Using equations (34) we obtain the following comparison: 

Mean 0 ; Theory *1304, Observation '1195, Standard Error *0068. 

<r.; „ -1185, „ ‘1052. 

a e 2 ; „ 01404, „ *01107, „ ‘00238. 

The distribution of 0 is so skew that the approximation to the standard error of a 
standard deviation used above is of doubtful value. The variances have therefore 
been compared, and differ by 1*25 times the standard error*. A comparison of 
goodness of fit was obtained by calculating the mid-ordinates of the Type VI curve 
for 0 y (37), and correcting to obtain the group frequencies. Using 17 groups, a value 
of P =s *270 was obtained. The cumulative frequencies are compared in Table I. 


Fig. I. Correlation of q and k. 



Correlation between k and q. 

It was found that r* g = + *1604; the standard error for zero theoretical corre¬ 
lation, were k and q normally distributed, is *0578, so that rt q differs from zero by 
about 2*8 times this standard error. The means of q for constant k are plotted in 
Fig. 1, where the observed regression straight line of q or k, or 

§* = 23*6669 + *5278 k .(38), 

has also been drawn. 

* For BOO samples the standard error of the variance is very nearly *J(p 2 ~ where the /8, for 
distribution (87) is approximately 9*67. 
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If these results are taken as a whole it mil be seen that there is nowhere any 
marked difference between the observed distributions and those of * normal theory.” 
Owing to the changing array standard deviations the position is not as simple as 
that considered in section (4) above, but there seems to be evidence that the changes 
there contemplated are beginning to occur. We may note: 

(a) The distribution of & is in good agreement with theory. 

(b) The mean q differs from the expected value by only about one-third its 
standard error, but the observed <r q is significantly greater than the “normal theory” 
value. This is as we should expect from (28), the array fa being 3*3; the slight 
excess of large values of q can be seen in Table I. 

(c) The observed mean 0 and <t 9 are a little low, but hardly significantly so, 
and a comparison of the cumulative frequencies of 250 in Table I does not suggest 
that any serious error would be introduced by making use of the 0 distribution (25). 

(i d ) Finally a positive correlation between k and q has appeared which is 
probably significant. 

There are, of course, so many directions in which the population form may be 
modified and so many changes to be rung in the values of N, a, c and p that it 
would be dangerous to draw too sweeping conclusions from a single experiment. 
Yet, as far as it goes, this appears to be a satisfactory result, and it suggests that in 
cases where we believe that the deviations from normal homoscedasticity in the 
y-arrays are of about the order existing in this experimental population, Fishers 
test may be used with confidence. 


(6 ) The Practical Determination of the Probability Integral of f(0). 

Let us first restate the problem; it is that of testing the hypothesis that a 
given sample comes from a population in which the regression of y on a follows 
a curve 

Y x =f(x; a lf a 2 ,...ac) . (9 bis). 


We either know that the population y-arrays are normal homoscedastic curves, 
or are prepared to take the risk of assuming that the deviation from this form is not 
suflScient to invalidate the test. We fit the regression curve to the sample by least 
squares and calculate 


. ™ S My m -Y x )*\ 

A Vyx ^ a?«l _ 

* 1 » — a 

Vv * s M) 


,(20 bis). 


Since it is when 6 is large that the hypothesis is unlikely to be true, we refer this 
quantity to the distribution it would follow in repeated samples were the hypothesis 
true, namely, 

a-c-2 N-e 

f(0) d0 “ constant x 6 3 (1 + 6) 2 d0 


,(25 bis), 
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find 1\ — I f (0) dff, and on the basis of these odds judge whether or no so great a 
' 0 

value of 0 is likely to have arisen through chance fluctuations. The two simplest 
cases that arise are when : 

(a) c = 1, and we wish to test the hypothesis that the population array means 
are constant. In this case the fitted regression line Y x = cr x becomes Y x = y> the 
mean of the N individuals in the sample, while from the definition of (18) 72 = 0 
and 0 = rfj 1 — rf. 

(b) c = 2, and we wish to test the hypothesis that the regression curve is linear 
but not necessarily parallel to the axis of x . Hero Y x — a i + a 2 x is the ordinary 
regression straight line of y on x , and R — r x)J> the coefficient of correlation in the 
sample. Then 0 = {rf — r*)/(l — rf). 

We shall now discuss several methods of calculating P 9 , the chance of obtaining 
in random sampling a more divergent result than that observed. 


1 . 72. A. Fisher's Method. 

(25) is a special case of Fishers general distribution referred to in section (1), 
which he takes as 

_ 1h -f 7lo 

f{0) d0 = constant x 0 2 (1 + 0) 2 d0 .(39). 


Writing z = - log,. 0^j , it follows that the distribution of z* is 


e lx \ z dz 


f(z) dz = constant x ,—■=- -ri-,——. 

J v ; (tii # 2 +■ w 2 )i< n i +n a) 


.(40). 


Tables VI of his Statistical Methods for Research Workers give for different 
values of n x and n 2 the values of z corresponding to the *05 and *01 proportionate 
tail areas of the z curve. In the present problem n x = a — c, n 2 = N— a, and 

1 , (rf — 72 2 N-a) 

2 -2 1o *-| r-v 


a — c 


.(41). 


The Tables can be entered with integral values of n x from 1 to 6 , then for 8 , 12 , 
24 and oo; and of n 2 from 1 to 30, then for 60 and oo. For many purposes this is 
adequate, but greater refinement is sometimes required. 


2 . T. L. Woo's Tables . 

These have been published in the present volume of this Journal. They were 
primarily intended for testing the significance of a value of rf, i.c. for the case 
c = l. If we use the transformation f=0/(l -f 0), equation (25) becomes of the 


form of ( 8 ) or 

a-c-2 N- a -2 

/(£) d% — constant x f * (1 — 0 ® df.(42), 

2 _ D2 

where .< 43 >- 

Then if c = 1, £= v * and if c-2, £=(,*- r*)/(l - r*). 


* This z must be distinguished from ‘‘Student’s’* z. 
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Mr Woo has taken c = 1 , and his tables are entered with N and n, which is a of 
the present paper. They may, however, be used for any value of c by equating his 
N to our JV- c +1 and his n to our a — c +1. The tables give for a wide range of 
values of N and n* t (1) Mean £ (2) er*, and (3) the ratio (£— Mean £)/<r( corre¬ 
sponding to tail areas of about ‘02 and 01. 


3. Other Methods of Approximation . 

The Tables of the Incomplete Beta Function f which are nearing completion in 
the Biometric Laboratory, will give the probability integral of (42) for a certain 
range of values of N f a and c, but it seems of interest to describe a form of 
approximation adequate for moderately large samples based on the Type III curve 
and the Incomplete Gamma Function. For the Type I curve written in the form 

y = yoZ p ~ l (i-O q - 1 

we have the following moment constants: 


Mean== j , + (/ = l^ if> = £(a-c), q = b(N-a) 


Variance: * - 


pq 


_ 2(o-c) (N-a) 

(P + ?J*(jP + ? + l) (N-c) 2 (N-c + 2) 

= *(p-q)Hp+q± 1) _ 8 (V z 2a + c)* (N-j+2) 
pq(p + q + 2f (N’-a)(a — c)(N-c- f-4) 2 J 

Further, we know that 

6 ( 02 l)/(2& -3A - 0) « - (p + q\ 

and consequently 

2&-8&-6 —12(&-ft-l )KN-c) . 


.(44). 


.(45). 


The relation (45) suggests that if N be not too small the (0 Xi 0 t ) point of the 
curve of (42), will lie close to the Type III line. The extent to which this is so 
is illustrated in Fig. 2, which shows for c = 2 how for a constant number of 
arrays, a, the point converges on the Type III line as N increases. We shall there¬ 
fore examine the adequacy of the following approximation • to represent the Type I 
curve (42) by a Type III curve with its mean, variance and 0 X having the values of 
(44), or approximations to these values. 

The equation to the curve, whose integral I (w, p) is given in the Tables of the 
Incomplete Gamma Function j*, is 

y =* constant x u .(46), 


u s (deviation from start)/(standard deviation), 
where -j Vp + 1 x standard deviation = distance from start to mean, 

p = 4/&-1. 


* N=51 to 1000 and n=8 to 20. 
t His Majesty's Stationery Offioe, 1022. 
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Hence we must take 
— °L~P W -o — (a-c))(-AT — c + 4)* 


-1 


2 (JV — c — 2(a — c))*(iV— c + 2) 

,^{ 1 + 3J^ + }_i. 

where we have expanded in inverse powers of N — c. 


Fig H. Showing the fi x , fi 6 Points for the Distribution of £ in the case c*2. 

A 


0 0-2 0*4* 0-6 0*8 1*0 t*2 1*4* 4*6 4*8 8*0 



In the same way we may obtain an expansion for the expression for given in 
(44), namely, 

.<*>■ 

Hence combining (47) and (48) we have 

Distance from start to mean = Vp + T <r { 


a —c f- o — c + 2 ) 

=F^( 1 + -irrr + ‘”} 


.(49). 


N-c 
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The fitted Type III curve does not start exactly at £= 0; the position is repre¬ 
sented in Fig. 3. The Type III curve starts at A , the true curve at 0, the means 
coincide at M, and it is desired to approximate to the tail area under the true 
curve beyond P by taking the corresponding area under the Type III curve, 

AP=OP + AO = OP + AM-OM 


«t+ 


o-c 

N-c 



o-c + 2 ) a-c 

tr-e •"]■ N-c 


y . (0-c)(0-C + 2) 

(N-c)* 


using equations (44) and (49). The Tables of the Incomplete Gamma Function are 
now to be entered with the p of (47) and u = AP/cf or 


u 



(a- c) (a - c + 2 ) 

(N-ff_~_ 


V2(a- 



— c -f- 2 
'N-c 



(50). 


Fig. 11 



There are now possible two degrees of approximation. 


Take the p of (47) and the u of (50) as far as the terms given; this will involve 
interpolating for both p and u. 

Method II. 

Take p » $ (a - c), u = NZj’J2 (a — c), that is to say assume that a and c may 
be neglected compared with N. In this case it may only be necessary to interpolate 
for u*. 

If the Tables of the Incomplete Gamma Function are not available, use can be* 
made in certain cases of Elderton’s x* Tables in Tables for Statisticians and 
Biometricians. The v® distribution is 

n — -ix» 

y * constant x (x*) 2 e .(51). 

* In the Incomplete Oamma Function Tablet (1923) the argument interval tor p ie 0*1 up to 6-0, 
but beyond this it ia 0-2, e.g. there is a oolumn (or p=4'5, but lor 6-5 we must interpolate between 
jp = 5*4 and 6*6. 

Biometrika xxx 


24 
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This corresponds to (46) if wo write 

^ = 2 u Vp +1, n' = 2p + 3, 

and consequently wc have two approximations corresponding 


to Methods I and II. 


Method III. 


Enter Elderton’s Tables with 




+ 


(a - c) (a — c + 2 )' 
(N— c ) 4 


)• 


• a — c + 1 4- 


3 (a — c) (a — c + 2) 
N-c “ * 


It is here necessary to interpolate between the columns of ri, which is not easy to 
do accurately. 


Method IV. 

To a rougher approximation use 

X 2 = N%, v! = a — c + 1 . 

Here n f will have an integral value and it is only necessary to interpolate for ^ a . 


TABLE II. 

Values of P$. 



Number of 






array r 

a 

f=- 

s 1 - r‘ 

True I\ 

by i 

P ( by II 

100 

0 

•1141 

•0193 

•0190 

•0284 

( 

6 

■0018 

•0130 

•0130 

•0145 

2(H) | 

14 

*1120 

•0285 

*0277 

•0322 

14 

•1191 

•0170 ! 

•0173 

•0208 

l 

22 

•1057 

•0248 

•0230 

•0314 

{ 

14 

•0453 

•0292 

•0291 

•0307 

500 \ 

14 

•0482 

•0184 

•0183 

•0190 

{ 

22 

•0072 

•0200 

•0257 

•0285 

1000 

22 

•0338 

•0204 

•0203 

•0277 


By taking certain values from Mr Woo’s tables, it has been possible to examine 
the closeness of approximation of Methods I and II; except for the difficulty in 
accurate interpolation III and IV would give the same results as I and II respec¬ 
tively. Suppose that we take the case c « 2, or are testing whether the regression 
of y or x is linear, and that we found in the nine samples with values for N and a 
shown in Table II, the values of if* (-> 7 * — r*)/(l — r®) given in the 3rd column. 
Then the true values of P;* found by Mr Woo from the appropriate Type I distri¬ 
butions are set out in the 4th column, while those found by using the approximate 

* That is to say the ehance of f exceeding the observed value in random sampling were the hypothesis 
tested true. 
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Methods I and II are in the 5th and 6th columns. While not attempting to be 
mathematically exact, there can be little doubt that Method I gives values for P< 
accurate enough for most practical statistical work. As we should expect for a 
given N the error increases as the number of arrays is increased. For N below 100 
and a large number of arrays the approximation will no doubt become less satis¬ 
factory, but this field will be covered by the Tables of the Incomplete Beta Function . 
For large samples the gain in speed by using Method II may well be felt to 
compensate for the loss in accuracy. 

These results only provide a comparison at the level of significance P{ *■ 03 to 
*01. It seemed desirable to examine the degree of approximation throughout the 
whole range of the curve, and this has beeu done in three cases, namely, N *= 102, 
c = 2, a*8; JV* 202, c = 2, a = 14; iV=5 02 , c = 2 , a = 22 . The true probability 
integrals were found by quadrature of the curves 


TABLE III. 

Showing the Chance of Exceeding Certain Values of 



Af = 102, 

c = 2, a = 8 


A' = 202, 

c = 2, tt = 14 

AT=502, c=2, a =22 


, or chance of exceeding £ 


1 *£, or chance of exceeding f 


P^, or ohance of exceeding £ 


True 

value 

Method I 

Method 11 


True 

value 

Method I 

Method 11 

i* 

True 

value 

Method 1 

Method 11 

•ooo 

1-0000 


1 *0000 

*000 

1-0000 

1-0000 

1-0000 

•ooo 

1*0000 



•012 

•9780 

•9731 

•9940 

•024 

•9676 

•9642 

•9636 

•016 

•9926 

•9919 

•9918 

•030 

•7414 

•7362 

•8348 

•050 

•6250 

•Cl 70 

•6116 

•030 

•7825 

*7791 

•7747 

•OOO 

•4301 

•4233 

•5295 

•076 

•2280 

•2215 

•2269 

•038 

•5256 

*5212 

•6193 

•084 

•2089 

•202(5 

•2726 

*102 

•0640 

•0522 

•0582 

*050 

•1985 

*1953 

•1995 

•108 

•0893 

•0859 

•1219 

•128 

•0093 

*0093 

•0118 

•064 

•0399 

•0391 

*0427 

•132 

•0346 

•0334 

•0493 

•154 

•0012 

•0013 

•0020 

•076 

•0074 

•0073 

*0087 

•156 

•0123 

•0122 

•0186 

•180 

•0001 

•0002 

■0003 

•090 

■0008 

•0008 

•0011 

•180 

•0041 

•0043 

•0066 





*102 

•0001 

•0001 

•0002 

*204 

•0012 

•0014 

•0023 









•228 

•0003 

•0005 

•0008 









•252 

•0001 

*0001 

•0002 



1 







ordinates being computed at intervals for f of 003 in the first case and of *002 in 
the other two cases. The results are shown in Table III. The adequacy of Method I 
for the common purposes of this test can hardly be questioned; Method II is less 
satisfactory, particularly for the sample of 102, but in all cases the agreement 
will be better as the number of arrays is decreased compared with the size of 
the sample. 


24—2 
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(7 ) The Distribution of the Correlation Coefficient 
in the Experiments. 

The sampling distribution of r first obtained by R. A. Fisher in 1915* is for 
two normally correlated and continuous variables. The population distributions of 
Experiments 1 and II are neither of them of this form. In the first case the y-arrays 
are normally distributed and contain five groups to the standard deviation, but there 
arc only three alternative values of x> — 1 , 0 and + 1 ; further, the proportions in 
these three ^-marginal totals are p x — *40, p 2 = *35, p 3 = *25. That is to say, the 
^-distribution makes no approach either to normality or continuity. For Experi¬ 
ment II the y-arrays are skew curves with varying standard deviations, while there 
are five values for x ) with proportional frequencies in the ^-margin of pi = *l667, 
p 2 = *2666, p 3 = *2167, p 4 = 1833, p B = *1667. Here again there is no approach to a 
continuous normal distribution. Let us examine how closely the observed distribu¬ 
tions of r conform to the sampling distributions of " normal theory.” 

TABLE IV. 

Distribution of the Goiirelation Coefficient. 


Experiment I 

Experiment II 

r 

(Central Values) 

Observed 

Frequency 

Normal Theory 
Frequency 

r 

(Central Values) 

Observed 

Frequency 

Normal Theory 
Frequency 

-•05 

1 

0*8 

+ •02 

2 

2-2 



(- *05 & Wow) 




•00 

— 

0*7 

+ •06 

1 

1*6 

+ *05 

— 

1*1 

+ •10 

3 

2-7 

+ *10 

1 

1*8 

+ •14 

1 

4-1 

+ * 15 

3 

2*7 

+ *18 

6 

6-1 

+•20 

5 

4-0 

+ *22 

8 

8*8 

+ •25 

8 

5-8 

+ *26 

17 

12-1 

+ •30 

6 

8-2 

+ *30 

15 

16-0 

+ •35 

15 

1 l-l 

+ •34 

20 

20-4 

+ •40 

24 

14*4 

+ •38 

18 

24-8 

+ *45 

19 

18*0 

+ *42 

31 

28-7 

+ •50 

15 

21*4 

+ •46 

30 

31 *5 

+ -55 

21 

23-8 

+ *50 

37 

32*4 

+ •60 

19 

24-5 

+ •54 

33 

31-0 

+ *65 

26 

22*8 

+ •58 

28 

27*2 

+ •70 

21 

18*5 

+ •62 

19 

21*4 

+ •75 

13 . 

12*3 

+ •66 

15 

14-8 

+ •80 

1 

6-0 

+ •70 

5 

8-6 

+ *85 

2 

2*1 

+ *74 

5 

4-0 



(+ *85 & above) 

+ ‘78 

5 

1*3 




+ •82 

1 

0*3 






(+ *82 & above) 

Total 

200 

200*0 

Total 

300 

300*0 


Biometrika , Vol. x. pp. 507 et seq. 
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Experiment 7. 

Here 20, p *■ *6346 {population coefficient of correlation), and the theoretical 
distribution can be obtained by interpolating between the columns of ordinates for 
p »‘5 and *6 given in Table A, p. 396, of the Cooperative Study on the distribution 
of r*. Second difference interpolation was used and a correction made to obtain 
group frequencies from mid-ordinates. The observed and theoretical results are 
compared in Table IV; the Goodness of Fit test with 11 groups gives P*‘223. 
The following comparison was also made: 

Mean r: Theory 5244, Observation ‘5160, Standard Errorf *0120. 

<jy: „ ‘1704, „ *1614, ,, '0097. 

These two quantities are somewhat less than the “ normal theory” values, but the 
differences are less than the standard errors. The frequencies show some irregularity 
in the centre, but the numbers are not large enough to prove any significance in 
this. 

Experiment II. 

Here N = 30, p = ’4626. We are now beyond the range of tables of ordinates 
contained in the “Cooperative Study.” The theoretical frequencies given in Table IV 
were calculated with the help of Fisher’s transformation by a method which will be 
described below. The agreement between “normal theory” and observation is 
excellent, the Goodness of Fit test with 14 groups giving P = -916. Further, we have 
the following comparison: 

Mean r: Theory *4563, Observation *4631, Standard Errorf *0086; 

(T r : „ ‘1488, „ *1475, „ *0064; 

the differences being again less than the standard errors. 

These two series of results are of considerable interest and suggest that the 
normal bivariate surface can be mutilated and distorted to a remarkable degree 
without affecting the frequency distribution of r in samples as small as 20, The 
^-distribution in both cases has been made platykurtic, and it is possible that less 
satisfactory results would follow if the surface were pulled out into a more lepto- 
kurtic form. 

(8) R. A. Fisher’s Transformation of the r-pisTRiBUTioN. 

This method of transformation, which has been referred to in the preceding 
section, appears to be of such value in small sample work that it seems worth 
recording here the following examination of the degree of approximation involved. 
The equation for the distribution of r in samples of n may be written % 

/(r)dr-constant x(l-r»)!T {^J^jdr.(52). 

* Bimetrika , Vol. xi. p. 898. 

t The standard errors are caloolaled as described in the footnote to p. 845. 

% Biometrika, Yol. x. p. 511. 




(53) , 

(54) , 
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Then the transformation 

r = tanh z, p = tanh £ . 

or z = \ ( lo g« (1 + r) - loge (1 - r)}) 

+ p)-log„(l-p)}j 

applied to (51) is such as to give for the distribution of z a close approximation to 
a normal curve with mean at f and standard deviation equal to l/Vn- 3 . That is 
to say, the distribution of z is almost invariant in form with a standard deviation 
depending only on the size of the sample and not on p. The moment constants of 
the distribution of z have been given by Fisher in the form of series in inverse 
powers of w- 1 *, and it is seen from these that the approximation is likely to be 
least satisfactory when p is large and n is small. The results shown in Table V 
have been computed for samples of 10 and of 20 from his series. Mean z differs 
from f by a quantity of the order of p/2 (n - 1 ) and is the most variable of the 
expressions tabled. 1 /Vw — 3 is seen to be quite a good approximation to a Zt at any 
rate in samples of 20 , and if the distributions are slightly leptokurtic they are at 
any rate symmetrical. 

TABLE V. 


Moment Constants of Distribution of z. 



These results do not of course show whether sufficient terms are given in 
Fisher s series to insure convergence with n as low as 10 , but it is possible to test 
the adequacy of the assumption that z is distributed normally in another way. 
Two tests were carried out. 

lest (a). The moment coefficients of the true theoretical distribution of r are 
given as series in the Cooperative Study f. Taking a sample of 30 and p«’462 579 
(as for Experiment II above), the following values were obtained: 

Mean r * *456 265; (r f = -148 818; ft *-244; ft » 3*252 

# Metron, Vol. i. Part iv. pp. 18 and 14. 
t Biometrika , Yol. xi. equations (xx), (xxi), (xxv), (xxvi). 


,( 55 ). 
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Using Fisher's series from Metron we find 

Mean * * *508 60; = 192 04 (N.B. l/s/T^S - 1925); & - ‘000 001; & * 3*0742 

.(56). 

Next values of r at intervals of *02 were taken between — ‘24 and + *86, and 

the corresponding values of z found from (54)*. The chance of a value of r 
lying in any of these subranges is the same as that of z lying in the corresponding 
subrange. We assume that z is normally distributed about *508 60 with standard 
deviation *192 04, obtain the proportional group frequencies from Sheppard's Tables 
of the Normal Curve, and hence have the grouped frequency distribution of r. The 
“normal theory” frequencies in the final column of Table IV above were obtained 
in this way. The moment constants of this distribution, were the process completely 
accurate, should be those of the series (55). Actually they were found to be 

Mean r«’4560; oy~* 1489; ft = ‘229; 3*175.(57). 

The agreement in the betas is not exact, but the z transformation seems to provide 
a quite adequate representation of the distribution of r. 

TABLE VI. 


Distributions of r. 



Chance of r lying below values 
shown in 1st column 


From quadrature 

From the z 

r 

with true ordinates 

transformation 

-•6 

•000008 

•0000003 

— *4 

•000067 

•ooo on 

-•2 

•000 36 

•00013 

•0 

•0016 

•0010 

+ *2 

•0062 

•0054 

+ •4 

•0249 

•0255 

+ *6 

•1037 

•1109 

+ *8 

•4431 

•4509 

+ •85 

*6165+ 

•6193 

+ •90 

•8133t 

•8130 

+ *95 

*9677t 

•9688 


Test (b ). Suppose a sample of 10 taken from a normal population with p = *8. 
In this case the distribution of r is included in the Table A of the Cooperative 
Study ( loc . cit. p. 386). It is seen to be a very skew curve with a modal ordinate 
at about r = *85, and fit *■ 3*1377, * 8*0534. Clearly it is not an easy distribution 

to handle, and but for these tables of ordinates we should be in difficulties when 
wanting to find the chance of r exceeding a certain value. The second column in 

* A table of this function is given at the end of the Metron paper. Only about X sample in 10,000 
lies outside the range r= - • 24 to + *80. 

t These values cannot be quite accurate as the r curve is too abrupt for a satisfactory quadrature 
from the tabled ordinates. 
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Table VI has been formed by applying quadrature to these ordinates. The e 
transformation leads to a distribution whose moment constants were calculated in 
forming Table V; they are 

Mean 0455 = 11441; <r,-'3691; & = -000005; 32808. 

Now make the simplifying assumption that z is normally distributed about 1T441 
with a standard deviation of '3691, and it is easy to find from Sheppard’s Tables 
the chance of z = tank -1 r exceeding any given value. Is the approximation 
adequate ? The figures in Table VI suggest that for most purposes it is. It must 
also be remembered that in taking n = 10 and p = - 8 we have chosen a most 
unfavourable case. 

The author is very much indebted to Miss M. Page for the sampling and 
computing work for Experiment I; to Mr A. E. Stone for the sampling and Mrs 
L. J. Comrie for the computing for Experiment II; to Mr Ernest Martin for the 
computing required for the comparison of Tablo III; and to Miss Ida McLearn for 
the three diagrams. 



INEQUALITIES FOR MOMENTS OF FREQUENCY FUNC 
TIONS AND FOR VARIOUS STATISTICAL CONSTANTS. 


By J. SHOHAT (JACQUES CHOKHATE). 

Introduction. The object of this paper is to derive certain inequalities for 
moments of frequency functions, and to show their applications, in particular, to 
the generalization of Bienaym6-Tchebycheff’s criterion in the Theory of Probability. 
The following notations will be used: (a, b), finite or infinite, for the interval of 

rb 

distribution (6 > a); F{x) for the law of distribution, so that dF (#) = 1; 

J a 

(i, for the «th moment of the distribution about the origin, or 

lh~ f*afdF(x) (s=0, 1, 2,...; *-l)...(l); 

P x e,d = P : [c £ x £ d] for the probability that the variable x satisfies the inequality 

c£x£d\ E(f)«= fdF to denote the expected value of/ (x). 

Ja 

Wo shall use extensively Stieltjes’s integrals, the advantage being that a single 
formula embraces the cases of a continuous, as well as of a discontinuous, distri¬ 
bution'. The function F ( x ) introduced above is non-decreasing in (a, b) and varies 

monotonically from F(a) = 0 to F(b) — 1. The case of a continuous distribution 

* 

corresponds to the assumption dF (x) = f (x) dx, F(x)= I f(x)dx, where /(«) 

“ the frequency function ” is integrable on (a, b). 

1. Fundamental inequalities. The basis of our discussion is formed by the 
following inequalities of Tchebycheff and Holder which the writer has extended 
elsewhere* to Stieltjes’s integrals: 

(A) f df.l ftftdf J | . [ f t df, 

J a J a J a J a 

1 8 f-1 

<B) J* uj. i#s [/*i/,r«+] : .[/‘i/. 1 '“<>+Y (,>1) - 

In (A) (Tchebycheff) and in (B) (Holder) ^(x) denotes a monotonic non¬ 
increasing function, fi,t(x) are two continuous functions, which in (A) both vary, 
for a £ x£i>, in the same sense (sign >), or in the opposite sense (sign <). (B), with 

/«(«) = 1. l/i(«) I s I/(®) I* 1 . ^ («* > gives 

h 

(C) /Vf 1 [/j/!'’#]'-[/V] ' (*.>>,>0). 

We notice that Schwartz’s inequality is a special case of (B), for 8 = 2. 

* J. Chokhate, “Bar let integrates de Stieltjes," Comptet rendu*, T. cutxxa. (1989), pp. 818—690. 
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2. In order to illustrate at once the importance of the above inequalities, we 
proceed to show that they yield directly many important results the proof of which, 
otherwise, requires special considerations in each case. 

(i) £ denoting an arbitrary constant, take in (C): f(x) = x — £, ^ (a?) - law of 
distribution F(or). This gives 

1 1 

/* |*-f I* 1 5 [£ I* - f I 8 ’ dF(x)J* (s t >>0)...(2). 


Hence, tfAe quantity v t 


-r/%- 


Zl'dFix) 


increases with s. This general 


property leads to many interesting results, by specifying s and £. 

l 

(a) Take £ = 0: increases with s for any distribution over any interval , 

l 

and so does , more genei'ally , fi 8 *, in case a £ 0. 

09) Take in (2) £ = arithmetical mean of the values of x, and denote by y! k 
the &th moment of the distribution about the mean 


(/* a#) r 


•JJr-;, >1 (« > r); /3 M = ^ > 1 (*-2, 8. ...(3). 

(ii) In (A) replace yfr(x) by the law of distribution F(x) 

E(fi/*)ZB(fdB(f,) .(4), 

. K (/”) > {E (/)}*.(5)*. 

(4) holds for any two functions j\ 2 (#) continuous in (a, 6), provided they both 
vary, for a* x£b, in the same sense (sign >) or in the opposite sense (sign <); 
(5) holds for any / (x) continuous in (a, b ). 

The following remark is important. Suppose we are dealing with a discrete 
distribution, x attaining a finite number of values x ti x %y ..., x 1n . Then Stieltjes’s 
integrals reduce to finite sums, for example, 

fbm 

f (x) dF - % f (Xi) ffi [tr< = F (X{ + 0) — F(xi - 0)1 

•’a i=l 

and the condition of continuity of f(x) evidently can be omitted. 

(iii) Integrating by parts the expression for fi 8> we get 


By (A) 


p, = JjF* (x) j —aj x^ 1 F (x)dx = b" — 8 j x K ~ 1 F(x)dx. 

[ dx. f of* 1 F (x)dx> f x "- 1 dx. f F (x) dx, 

J Q> J Cl J A J A 

J F{x)dx [(a, b) finite, aSO; «= 1, 2, ...] ...(6). 


* G. Bohlm&nn, “Pormulienmg und Begriindung zweier Hfilfsfl&tze der mathematuschen Statistik,” 
MathematUche Annalen , Bd. lxxxv. (1913), pp. 841—412 ; pp. 374—5. 
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For a symmetric distribution over a finite interval (— a, a) similarly 

pt* < 2a 2 ® — 2a 2 *” 1 ( F(x) dx (s = 1, 2, ...).(7). 

Jo 


(iv) Let a £ 0. In (A) introduce the non-decreasing function 


at dF (x) (k £ 0), 


where F(x) represents the law of distribution over (a, 6), and take fi(x)^xJ f 
ft (on) s x m (l, m positive integers or zero). This gives 


PkPk+l+m > Pk+lpk+m 
Pl+m ^ Hi Pm \ 


(k, l, rn « 0, 1, 2, 


l f m > 0)...(8). 


(8) holds for any distribution over any interval (a, b), provided a^O. 


(v) Finally, we derive, by means of (A), two inequalities which we shall 
frequently use in our discussion: 

(D, fV/ W< bi r ^=^i F) .|Vw* «>£„<», 

where f(x) is non-decreasing (sign >) or non-increasing (sign <) in (a, £); 

(e> (»=.<« 

where / (x) represents the frequency function over (a, 6), with a£a< fi &b. 


3. Continuous f Vshaped symmetric distribution over a finite interval . Here 
the law of distribution is represented by dF (x) = / (x) dx y where f(x) is an 
even function in the interval (—a, a) with a single maximum at a? = 0. Thus 
x (the mean value of x) * 0. Hence 


[ f 0*0 dx — \, p2fi—l 
Jo 

*0, Ht, = 2 [ a?* f (x) dx (*=1,2,. 

Jo 

(9). 

Applying (D), we get 



a 2 * 

W ' < 2» + 1 

(. = 1,2,...). „ = . 

....(10). 


Now let f(x) be subjoct to the following conditions: 

(I) x M f(x) increases in (0, a) for a certain positive integral k [example: xPe"^ 
in (0, 1)]. We notice that (I) is satisfied a fortiori for any V >k. Thus, we can 
apply (A) to 

x Sit ^.x ak f(x)dx ( 8>k), 

Jo 

a 2*-2* 

. Mi ° > 2s — 2k + 1 m ( 8>k >.^ 

We can go further and find the asymptotic expression, for s -*■ oo, of fit,, if 

(H) in a sufficiently small interval (a — h£x&a) f(x) * (a — x) ¥ q (x) [v > 0, 
q (a) 4= 0, q (x) continuous in (a — 8, a)]. 
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Then, as it has been shown by the writer*, 

The above inequalities (10, 11) enable us to find a lower bound for a in case 
the distribution is known to be of the type under consideration over a finite 
interval of unknown length (2a), 

a > V(2s + 1)/**, (s=l,2,...); a >a V3...(13), 


■<f 


(2s - 2k + 1) fit, "12* - 2 k 




J 


[under condition (I); $>&]...(14). 


4. Continuous fl -shaped asymmetric distribution over a finite interval. Again we 
choose the origin at the maximum. Write 

r o rb 

fijc — f affix) dx + f (x) dx ( a<0<b ), 

J a Jo 


and apply to each integral the inequality (D), 

c 2 " 




2s +1 


(s=l, 2, ...); <r<cV3...(15), 


„2»-l 1 , 211-1 -2«-l 

< M*-i < "27 . | Mas-iI < 2 j- [c = max. (|a|, 6); s = l, 2, (16). 

5. U-shaped continuous distribution over a finite interval. 

(i) Symmetric distribution Taking the origin at the minimum and using the 
same notations as in § 3, we get (inequality (D)) 

,, 2 * 


fl2s> 


25+1 


(s=l, 2,...); <r>-g?...(17). 


Here certainly atf(x) increases in (0, a) for any k S 0. Hence 


',2s—2k 


Htt > , 


28 - 2k + 

(ii) Asymmetric distribution. By (D) 

a**f f(x)dec + t 2 * [ f(x) dx 
J a Jo 


j (®i k —1,2,...; 8 > A?) .. .(18). 


Hss> 


H2s > 


d 2 * 

2s+ 1 


2s+l 

[d = min. (| a|, b); s = l, 2, ...] ...(19). 


(For pto+i we get, by Schwartz’s inequality, the less satisjactcgy result 

I Mt»+i I < ’JfMs+t < c 2 ** 1 [c = max. (| a \, &)].) 

# J. Shohat, 44 On the Asymptotic Expressions of Certain Definite Integrals/ 1 Annals of Mathe - 
maticty Vol. xxvn. (1925), pp. 8—11; p. 6. 
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6. Generalization of Bienaymt-Tchebycheff’s criterion. The problem can be 
stated as follows. Given two constants h(> 0) and £, find bounds for the probability 

. l 

PsP:[|® — £ |2 A),tn terms of the quantities v,= \x — Thesimplest 

procedure generally adopted is the following*. We write 

v,'-f |«-f| *dF+f |® —{|*dP = »x + t* (s> 0 )...( 20 ), 

and then, neglecting entirely t'i, 

n, , Zi i >h‘(l -P), 

P«P:[|*-f|*A]>l-g)' .(21), 

P = P:[|®-f|SX Vr 3>l-igy («, r > 0) ...(22). 

The very procedure shows that the limitations (21, 22) are, in general, too 
gross. In order to get a better understanding of these inequalities, we give the 
following properties of v t : 

1°. 1 *a# p 0 £ i>i ^ v% £ ...» for any distribution over any interval. 


*+i 


2°. limi/ tf =lim — x — = M{ = max. (|a —£|, | 6 — f |) [Jiff ® oo , if (a, 6) be 

*->ao Vti 


infinite]. 


3°. lim >1; lim = x (s-»- x; r > 0, fixed). 

4°. lim ^ < 1 (r x; s > 0, fixed). 

(1°) has been shown above (formula (2)); the first part of (2°) has been 
established by the writcrf. To complete the proof of (2°), we notice that 

j/,* 8 £ v,_x“ _1 . v„ + i‘ +l ; - ~ £ + — (Schwartz’s inequality) ...(23), 

v 8-l v * 


**+l 


# 4-1 


hence, lim exists and is equal (by a well-known proposition) to lim v 9 , 

#—►<» V 9 t-+ 00 

Furthermore, if (a, 6) be infinite, then, for a sufficiently large s (since 

lim ["[ cJF(®)]*s»l, 

*-►00 J 


«+l 




M( (mean-value theorem) ...(24); 


* Of. Pearson, Biometrika , Vol. zn. (1918—19), pp. 284—296; Naromi, ibid . Vol* xv. pp. 245—254; 
Goldberg, Compte* rendu*, T. olxxv. (1922), pp. 418—420, 679—680, 1382—1884; Lorqoin, ibid, 
pp. 681—688; Oamp, Bulletin of the American Mathematical Society , Vol. xxyixi. (1922), pp, 427—432; 
Meidell, Comptea rendu*, T. olxxv. (1922), pp. 806—808; T. olxxyi. (1928), pp. 280—282. 

t J. Shohat, 4 4 On the polynomial and trigonometric approximation of measurable bounded functions 
on a finite interval,” Mathematieche Annalen, Bd. on. (1929), pp. 157—175; pp. 168—4. 




.(25). 


306 Inequalities for Moments of Frequency Functions 

where 0 > 0 is arbitrarily large, but fixed) 

v § z\[ \x-Z\»dF\* ZiG . 

L'|ir-£I«2G J 

(3°, 4°) follow directly from (1°, 2 U ). 

The aforesaid considerations show that if we take in (22) r < s, the coefficient of 
1/V is > 1. Moreover, with r fixed , the usefulness of (22) generally decreases , as 8 
increases , for then the interval of admissible values for X 

M ( 


- < X < 

V r V r 


.(26) 


gets smaller and smaller; for s very large, \v r must be taken very close to and 
(22) loses its meaning, whether (a, b) be finite or infinite. On the other hand, if 
we take in (22) r>s* y the coefficient of 1/V is <1, and (22) becomes applicable , even 
with X< 1. Moreover, as r increases indefinitely, s remaining fixed, the interval of 
admissible values for X approaches, in case of (a, b) finite, a limiting interval 
1), whose length is different from zero. In case of ( a , b) infinite, the upper 
bound for X is oo, and its lower bound approaches (under the said condition) 0. 

7. Criterion analogous to that of Bienayme-Tchebycheff for certain distributions 
over a finite interval . 

(i) fl -shaped continuous symmetric distribution (§ 3). We get from (10) 


*j+ i > 2 J, h )> 


2 8 + 


1^ 

2 s +1 


iV**s :[|»| * *»] > 1 - g--, (£)* 


sr. 

.(27), 

i (£)‘ . 

.(28). 


(27, 28) do not require the computation of p^ (being useful, of course, so long 
as their right-hand members remain > 0). 

Apply now the inequality (E) to the integrals i ly i a in the expression 

Pit — zf x 2 *f(x)dx + 2 f a^f{x)dx~i x ^i 2 (h< a)...(29), 

JO J h 

ip 

2s+l 


„2*+l _ /. il+1 

P *~ h ' h ‘ *• < /o“ , 


(28 + l)(a-hy 


.(30), 


_ h h (P+y- IP +1 - (a -A)(2s+ 1)^ CP* 1 -hP +1 -(a- h)(2s + 1) a 4 " 

*- < - a (uP-IP) ' < a (cP — IP) 

...(31). 

We thus obtain an upper bound for the probability P x ~ h ' h in addition to its 
lower bound given by (21) 

.(32). 


* For r=#, X must be greater than 1. 
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(32) becomes inapplicable for sufficiently large s and any h<a or for any s 

j. 

and Acpa. 8 *. but i n that cai)e ( 31 ) becomes applicable, for its right-hand member 
is then necessarily < 1. In fact, the contrary assumption leads to 

2 , h , 

P*. <-j < K 

x (2s 4-1)* 

while lim pa, 2 * — a>h. 

8-m-ao 

(ii) fl -shaped continuous asymmetric distribution. The results of § 4 give (see 
formulae (32, 15)) 

P: [| • | & h] > 1 - (?)* [o - max. <| a |, ft)]...( 88 ). 

In order to get an upper bound for P x ~ h ' h > write 

r-h r 0 rh rb 

pa, — I x u f(x)dx + I +1 4-1 = ii 4 -H 4-»s4- H (a<0<h< b), 

J a J-h J 0 J h 

and apply (E) to each of these integrals, then 

P-HK* + . H — ( 2 s + 1 )g 2 * 

* H - A* H-h» 


(u a + 

H = max.- 

\ a + 


hp+l JSs+l-6*8+1 


■h ’ b-h 
(iii) \J-shaped continuous symmetric distribution (§ 5). Here 


.(34). 


= x**f (x) dx + 2 j a?“f(x)dx 


h* D .. a *® +1 — A ** +1 

2s4-l * (2s4-l)(a-A) 


P*-*- A > 


a *«+i _ ; t 2 »+i _ ( 2 s 4 - 1 ) /ig, (a — h) 


(l-P a -*>), 

.(35). 


a («*•-/»*•) ." 

(iv) U-s/taped asymmetric continuous distribution. Employing the now familiar 
reasoning, we get from 

Pa. = J aPf(x)dx+J Jo 4 - j^: 
p*. > ^r n *."*•*+<rrri (1 -IV**), 


2s 4-1 
> JT + A* V 


2s 4-1 
■ min, 


a 8,+1 4 h* +1 

6*»+i _ 6*8 +i-|n 

a + h ’ 

6 -A Jj 


...(36). 


Here we can go further and discriminate between positive and negative values 
of x: 


fits > ( x 28 / (x) dw + [ a?*f(x) dx 
Jo Jh 

P x °> h > 


pp,h 4. . ^ fl _? i2g+ ?. (1 -P 0,h\ 

■* * “ /i. z.\ /o~ i i \ x * h 


2s4-1 "* * 1 (6-A)(2«4-l) 

6 * 8 +i _ 6 * 8 +i _ (J _ 6 ) (2s 4-1) pa. 


6 (&»» — A*-) 

P -M ^ ^ - 1 - “(" + *) ( 2 « + 1 ) 

* a (a®* 4- A®*) 


.(37), 


(37 ftis). 
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8. Distribution over an infinite interval Let the interval be (— oo, oo) (other 
cases can be treated similarly). We assume the existence of all moments to be 
used. 

(i) D-shaped continuous symmetric distribution . Here 

f f(x)dx = \\ pu-i = 0; = 2 f aFf(x)dx (* = 1,2,...). 

Jo Jo 

1 ft 

The ratio —. a?*f (x) dx varies monotonically from 0 to J, as t varies from 
Pta Jo 

0 to oo. Hence, there always exists one and only one solution a B of the equation 

^\ a ‘f{x)dx-^n t , (O<0<1) .(38), 

which leads to 

f <'«*/(») d*; = '**•/(<!:)<&.(39), 

4 s,“'-Tl- p ~"°‘ ,h y (E »4 STI <* = '■ 2. • ••)■ ■ •<«>■ 

Introduce now the probability P x ~ h >\ Then, if h^a 8} 

fits > 2 [ a?*f{x) dx + 2 f a?*f{x) dx > h* 8 (1 — P x ~ h ' h ) .(41), 

J 0 J h ' 

which leads again to the inequality 

* p-h,h ^ i _ P** ^ 1 a * (Af>\ 

h* 6 ' (2s +1) h» .™ 


and this, combined with (40), gives a very simple inequality for P x ~ h,h : 

1 

Px~ h,h >1---r (for P K ~ h,k £ P x ~ a,,a ‘, if a, = h) .(43). 

2s + 1+ ~0 

In order to use the above formulae for a given 0, it is sufficient to find an 
upper limit for a 8 (and similarly for a 8} b„ below), for ti > a 8 implies 

/;>/<.)*-1*.(«■>«>, *.4^4^. 

The simplest choice of 6 is 6<=> J, i.e. choose a, so that 


f ®V(a) dx S f ' aPf(x) dx .(44). 

■'a, JO 


2« + l 


P x ~ h ’ h > 1 - 


2 8 4 1 3 


(h £ tt.) 


(46). 
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(ii) Cl-shaped continuous asymmetric distribution. We choose again the origin 
at the maximum. The same reasoning as in (i) holds, with proper modifications. 
Given any 6 between 0 and 1, there exists a unique pair of numbers o, < 0, 6, > 0 
such that 


[ aPf(x)dx = 6 f aPf(w)dw; f aPf(x) da - 6 f ®**/(®)d® ...(47). 
JO JO J a* J -®o 

Hence, by (16) 

= \ |V(«) dw<\. ^ < \. (c. = max. [| o. |, b .]). ..(48). 

We can follow now the discussion of § 7, (ii) 




o." 


0'(2s+l)A*’ 


P* - *’* > 1 —• 


2*+l + 


(AS c,)...(49), 


n . H-~ (2* ■+1) _ H .- (2s +1) 

■ if,-A*» if, —A 2 * 


6 


(H. =max. [—) (a,< 0 < A < 6,)...(50). 


In case a 9 =—b t (which corresponds also to the symmetric case (i) above) 

n _ k ^b*«-h™-(b.-h)0(2s + l)p» ^b™-h™-(b.-h)ffo» 

± x < b, (6,** - A**) 6, (6,*" - A**) ( 

.(51). 


9. Let a general distribution over (— oo, oo ) be determined by the law of 
distribution F{x). Consider 


8,(A) = J VdP(«0+J*^dP(tf)>A*(l-P,-*-*) .(52), 

P x ~ h ‘ * > 1 - -r^, with lim 8, (A) = 0 .(53), 

nr *-><*> 


which, for h sufficiently large, is better than (32), provided we know, instead of 
/a*, the order, with respect to 1/A, of B t (h). This can be done in some special cases 
which follow. 

Continuous distribution , with the frequency function f (a) such that 
(III) f(x ). | x |* < M for | x | £ X (M> X > 0, k > 1—certain constants). 

Here we introduce 

$ (A) • !*/(*) + ["/(«) dx = \- P,-*.\ 

5 (*) -J* . | x I*/(<r) da + j* ^. «*/(*) d® < ( h “ A 

P*~ h h > 1 (under condition (III); A £ X) ...(54). 

Biometrika xxi 95 
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Notes . (i) (III) is h fortiori satisfied if 

lim f{x) |#|*»0 (&>1) .(55). 

I X | —*• *> 

Then in (54) lim M ** 0. 

(ii) If we replace (III) by a stronger condition 
(IV) / (x) < Me~ r \*\ v for | x | £ X (M, r, v —positive constants), 
then (54) is replaced by 

> 1 - ^ (h > max. ^1, Z, ; s > 0 arbitrary) ...(56). 

In fact, for such A = | x | necessarily e“ r W < - ( 


Editorial Note on the Limitation of Frequency Constants . 

(1) My impression is that most inequalities hitherto found are not of groat 
service in practical statistics; the limits are too wide. Some of the results reached 
by Dr Shohat are already familiar, others can be obtained by an analysis more 
customary with practical statisticians, and it is, perhaps, worth while considering 
them in connection with his memoir. 

I use the following notation. Let there be n values of a variate x , namely 
Xi, x 2 , x Sf ... x ny and let 

s p =*x 1 v + x 2 p + ... 4 -x n P, 

and = Spin be a moment coefficient about an arbitrary origin from which the x’& 
may be supposed measured. 


Let &,-a = ^ and Ar-i = 1 , 

f*2 

where /i, r = the rth moment coefficient when the arbitrary origin is taken about the 
mean. f¥ p may be used, when we put yl q for fi q in the above expression. 

(2) Lemma . The determinant A written down below is always positive. I owe 
the following proof of this fact, which had been otherwise brought to my notice, to 
Professor G. N. Watson. 



*1, 

S 2 

*i> 

#8, 

SB 

s 2r—% > 

*2n-l> 

Sgr 


*0, 

Si , 

St 


St) 

Sz 

tr 2r~t 

Km > 


Km* 


clearly. 


For z—v log f x | mast satisfy the inequality it holds for any z >0 if — < 1. 

TV TV 
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Expanding these determinants they may be written in the form: 

A *= S 8 x m ar a (Xp — Xq)P (x r —• a?p) ( x r «— # 9 ) 

/ p>? \ 

V*>, g#m/ 

=s A (fl?p — fl/7) ( Xq — a? m ) (# m — Xp) 

P+q+r 


x {- x p *~ % (x q - - V~* (ff w - a?p) - x m *- 2 (x p - #*)}, 

where the summation extends over all different sets of values of p, q> r, a set such as 
2, 7, 4 being regarded as the same as a set 2, 4, 7. It will be sufficient to prove 
that 


& = (a — 6) (6 - c) ( c - a) {— a ar “ 2 (b — c) — 6 2#wa (c - a) — c* r ~ a (a —6)} 

is always positive, for then every term of the above expression will be positive. The 
expression is symmetrical in a, b, c and remains the same if the signs are all 
changod. 


The possible cases are: 


a 

b 

c 


(i) (ii) (iii) (iv) (v) (vi) 

-f- + + — — — 

■+• -H — — — H” 

+ --- + + 


but by the second statement above (i) and (iv), (ii) and (v), (iii) and (vi) are really 
the same. It is sufficient therefore to consider (i), (ii) and (vi), i.e. a, b, c are all 
positive or two are positive and one negative. We have two cases then: 

(a) a>b>o, or (y9) a >b> 0 > o. 


(a) We may write 

k — (a — c) (a — 6)® (6 — c)® 

= (a — c)(a — 6)* (6 - c)® 


(a 21 '-* - &*»•-* _ 6*-» _ c*- a| 

( a—b b—c ) 

J a *r-s _ c 2r-3 + 6 ( a *-4 _ c ®>-4) 

+ 6® (a*'-® - c®*- 6 ) +...}. 


Since a > c, this expression is always positive. 


(/3) Here (a — b)(b — c ) (a — c) is always > 0, while 

a®*-® (b — c) + b* r ~ i (c-a) + c® r_ * (a — b) 
will be positive when c is negative, if 

ah (a* r ~® - 6* r ~ 8 ) + (- c) (a*'-* - &*’-*) + c®’-* (a - b) 

is positive. But since a>b, and c is negative, all three parts are essentially 
positive. 

25—2 
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Thus finally 


8u 

a> 


8 \ y 82 

82, 83 

Hr-1, Hr 


TT 


1 Ml M 2 
Ml Mi Ms 

Mar-a» Mar-i» Mar 

is essentially positive, or 

(* “ 7 r ) > ( 1 " Tv ) /3 *- 8 + ( 1_ %*) /9 '*- 4 . (i) - 

For the particular case of moments about the mean this becomes 

&%r-2 > #2r-3 + &r-4 .*.(ii). 

Hence it follows that 

#Br-2 > £0 + $1 + £3 + ... + &r-8.(iii) \ 

but # 0 = 1 , hence, if all the odd moments are of the same sign, it follows that an 
even j 8 will always be greater than unity. The relation for r * 2, 

£2 > 1 + fa .(iv), 

is very familiar as it gives the boundary to all frequency in the fi 2 plane. The 
frequency system then corresponds to the limit of the U-curvos, i.e., to two lumps 
of frequencies n x and 712 at distance 6, and lying on the line 0 2 — fix — 1 *= 0. 

We may ask whether it is the two lump-frequency which bounds the possible 
frequency when we consider other /9’s than fix, /9 a . We have 


Hence 


It follows that 


b z n\ri2 
(«! + «»)*’ 


Ms = 


= fr 8?t (»i ~ w,) 

(n 2 + «*)*• ’ 




O _ A**/**r-i _ (ni-n t )(n 1 * r - t -n»* r -*) 

Pir-S *= ,,-+.1 !/■« \ > 


N 


(njn*)'- 1 (», + «,) 


n 1 Sr_1 + n,’ 


»r—1 


P !__ 

* r_ * M* r (win*) r-1 («i + nt) ’ 


fitr -«’ 




_ 

M* r_1 («i + n») ’ 
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or the two lamps form again the limiting condition, beyond which the possible 
frequencies, bounded by 

fitr-t > @*r-i + &r-S, 

cannot pass. 

At the point & = 0, /9 t = 1, we have two equal lumps; all the odd yS's vanish and 
& - & = • • ■ - £*■-* = etc.«1, 
or all the even £’s take their limiting value. 

(3) For symmetrical curves with continuous convex curvature and range 2a the 
limit is the rectangle when we deal with the greatest value of any moment 
coefficient. Hence we must have 

M*r < a 8r /(2r + 1).(vi). 


or, for r = 1, 


i.e. «r< 


i.e. a rectangular distribution gives for such curves the maximum a (or radius of 
gyration) about the mean. 

(4) Now for U-shaped curves* the higher the moment coefficient the more 
closely its value will approach the two lumps moment coefficient and diverge from 
the moment coefficient of the rectangle, for'the higher moment gives more weight to 
the outlying values than the lower moment. Hence, if s be > k, 


2s +1 2k+ 1 


Mt * > (2*+ l)/(2fc + 1) ^ . (Vm)> 

This is a higher limit than that provided by Dr Shohat in his Equation (11). 
Since 2k +1 < 2s +1 because k < s or 2k (2k + 1) < 2k(2s +1), 

2s +1 < (2s + l)(2k +1) — 2& (2fc+l), 


2s + 1 
2 & +1 


< 2s + 1 — 2k 


(iu> 2s-2k+l (iik 


which is Dr Shohat’s Equation (11). 

/a\*-“2ife + l 


ftu-% > 


a<<r|i 


) 2s+l™- ; 

2s-H ft_ t ]»^ 
2* + l/5*-.J 


* For n-<n>rvM the sign ot the inequality most he reversed. 
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(5) Now consider the curve 


»-»( i -ST- 


where p must lie between 0 and 1. In this case it is easy to show that 


Mm 


sa* 


M«-i> 


and accordingly 
and 


8 + 2 - 2p 1 

1.3.5...(2$-l)a* 

(3 — 2p) (5 — 2p) ... (2s +1 — 2p) ’ 

Hh will be > ^**~ a 
Wlu 06 > „*«-a > 


a* 
2s+l 


2s-1 


since 


2s + 1 
2s + 1 - 2p 


> 1 . 


Accordingly 


NOW, /X2 = 


3-2!) 


a *»-2* < |li_? < ( 2s — 2A; + 1) ^ . 

2& + l/ia* M** 

, or we may put, as illustration, a = 1, p = \ } which give 
= 


Mfcr 


Mitt = ' 


= 1 / 2 , 

1.3.5 ... 2s — 1 


2.4 ... 2s 

1.3.5 ... 2£—1 
2.4...2A 


Hence 


, 2.4...2A 1.3.5 ...(2s + 1) 
a < 2.4 ... 2s ’ 1.3.5 ... (2A +1) 

(2s + l)!2 8 *(^!)» 


2 a *(sl) 8 (2A; +1)!‘ 

For example, if we work from the fourth and second moments, i.e. s = 2, 1, 

a 2 < 1*25, or a <1*12. 

Using Dr Shohat’s inequality we have 

a <1*50. 

In the former case we are 12 0 / o and in the latter 50 % beyond the true value. 
If we take 5 = 4, k = 2, we have 

a < 1*07, 

or we are 7 % above the true value. Dr Shohat’s formula gives a < 1*38, or 38 % 
above the true value. 

In neither case should I personally feel justified in using an eighth moment 
coefficient, as the probable error of such a moment is too large in the case of the 
usual sized sample*. 


* The safer #=8, &=2 gives a <1*08. 
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(6) If all the x’b are > 0, suppose them arranged in order of magnitude, then 
X p k X q k (x p l — x q l ) (x p m — x q m ), 
when p>q, will always be positive, and therefore 

„ S (x p k X q k (x p - X q l ) (X p m — X q n )), 

where 8 denotes that p and q are summed for every pair in 1, 2, 3, ... », but 
p > q, will always be positive. Accordingly: 

S X p k X p k+t+m + Sx p k X q k (x p l — X q l ) (x p m — X q m ) > 8 Xp k+l Xp k+m , 

P“1 P=1 

8x k (X k - ki+m + 8' x q k+l+m ) > Sx k+l (x*™ + S' x q k+m ), 

q+p q+p 

(x x k + #**+...+ X n k ) (Xl M + m + X % M + m -f ... + X n k + l + m ) 

> (x x k + l + X t M + ... + x n k + l ) (xi k+m + X* k + m + ... 4* #n* +m )> 
tifl k X Tlfl k+U-m ^ nfL k+l ^ fc+m» 


or A X *+*+m > p k+l X M'*+m ..(xi), 

which agrees with Dr Shohat’s Equation (8). 

If we put k~0, fi k *= 1, 

Pl+m > t*'l x fl'm .(xii). 


Unfortunately this is not demonstrated for moment coefficients about the 
mean, but only for variates algebraically > 0. It is true for moments about the 


mean, if k } f, m are even powers, or 

> tH(k+l)Wnk+m) .(xiii). 

M 2 (i+ 7 rt) > .(xiv), 

fiui+m-1) > • 

Thus we see if l = m = s, 

Pto -2 > 0 * 28-2 .(xv), 

Ps *-2 > i8Va > fas -2 .(xvi). 


Innumerable such relations may be deduced. But such inequalities teach us 
little. For example, in the case of the normal curve, (xv) and (xvi) tell us that, 
putting 8 * 2, 

3.5.7 > 9, or, that 35 > 3; 

and 3.5.7.9.11.13.15 >81, or, that 25025 > 1. 

Such limits are of small practical service. 


K. P. 









ON THE STANDARD ERROR OF THE MEAN SQUARE 

CONTINGENCY. 


* 


By TSUTOMU KONDO, Professor of Mathematics, Higher Commercial College, 

Yamaguchi, Japan. 


I. Introduction. 


(1) Let us consider a table of contingency and let n„ be the frequency of the 
p ,qth cell; n P ., n.„ os usual, the marginal totals of the pth row and gth column; 
N the total frequency and k, \ the numbers of rows and columns respectively, then 
the mean square contingency is defined by 


1 P Z K 
8 8 
p=I q-l 



V )7( W ^)1 * 


which can be transformed into the simple form 

p-\ q~l 


The mean square contingency is usually denoted by <p 2 and we have the following 
fundamental equation 


P=k q-\ 

s s 

p=l q=1 



,( 1 ). 


In the right-hand side of this equation if, for and n,„, their expected means w„. 
and n* are substituted, then we get an expression 


P~K fl = A 
8 8 
p=i ?= 


* / V \ 

i \n p .n. B / 


1 


•( 2 ). 


which is usually denoted by fa 2 , and problems about the mean and standard error 
of (fo 2 have been completely solved by Prof. Karl Pearson and A. W. Young*. 

But problems of the same kind for <p! 2 have not yet been fully solved by 
anyone, and I want, here, to consider these problems. 


(2) Now, for simplicity, let us denote by m, the contents of the «th division of 
the contingency table formed by the sampled population of size M ; and let n, be 
the mean or expected value of the frequency n, of a sample of size N which corre¬ 
sponds to m,; then, if p is the probability of an individual being in the s-class, 

m, n. 

P = M 0 T P m N' 

provided that the number of repeated samples is very large. 

Again, let Bn, be the deviation of n, from its mean, then 

«,-«, + Bn„ 


1Hmetrtka, Yol. xx. p. 916. 
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and it has been proved* that deviations Bn, arrange .themselves according to a 
hypergeometrio series of which the moment coefficients are given by 

to-XiNpq, Pi = XiX»Wpq(p-q), 

P* “ Xx N P1 (3*8^?? + X*), 


where 


i i N ~ 1 


X*'- 


1- 


2 (N-l) 
M -2 


.(3). 


and 


/, 2\L N-l/N-10. 9 \| 

i^Jj 1 M-2\N-2 + jf-3/j' 

N-2\ 

X 4 =s l_ 6 _ 2 ^___j 


But for sampling from infinite populations, to which I propose to confine my 
attention, we may put 

Xi“X* = X4= ! l an <l = 1 - ^.(4). 

From these formulae, for infinite populations, we can deduce f the following expres¬ 
sions for means which are fundamental equations in the theory of this paper: 


Mean (Bn,y = ”« ( J “ • 

Mean 8 n a 8n a , ® — *^jjr > 

Mean (8»„) 3 = n, (l - y) (l - y . 
Mean (Bn,)* Bn,. = - ^ (l - y . 


2 

Mean 8 n a 8n t »8n a » = 


Mean (Bn,)* = ft, (l - |l + 3n, (l - (l -1)}, 
Mean (8»„) 3 Sn 8 - - - jl + 3 (l - n, (l - , 

Mean (Sn,)* (&v) 3 - j 1 + (* “ y) {^-n,- n,.+ ^^-)J . 
Mean (Bn,y Bn,. Bn,.. “ - ( J ~ y) W, y Wr (l - y «*). 


and Mean Bn,Bn,.Bn,..Bn^>. = 8(1 — 


2 \ n,n,.n,..n,... 


N* 


.(6). 


* K. Pearson, Phil. Mag. 1899, p. 389. 
t These formulae are all giyea in Biometrika, Vol. xi. p. 917. 
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II. The Deviation of fa*. 

(3) The mean square contingency of the sampled population is given by the 
following equation 

2>=* g«A / fj 2 \ 

6*= S 8 (J^H-1, 

P =i g=i \n p .nj 

or simply by $* = 8 . \ — 1, 

where, and hereafter, 8 stands for the double summation 

ttaeff q — k 
8 8 . 
p~X gel 

Now let $<l>i 2 be the deviation of <f> j a from the population value $ 2 , and if we 
write as follows, 

Mean Sfa 2 = fx 1 ', Mean (S^ 2 ) 2 = Ma', 
then the mean aud standard deviation of are given by 

Mean say) = + Mi' .(6), 

and <r<n* = ^W-(Mi')*.CO- 

Therefore, if we can find ^ and we can at once find the mean and standard 
deviation of fa 2 . But 

\n P .n^J \n Pm n. Q J 

s\M- \(i + 8 My(i + ^) fi + w-ill. 

Lnp.n* (\ KpqJ / \ n P J\ n. a ) JJ 
And, unless the expression 

/ n . Sn w \ 2 // 8n P ,\ ( &n, q \ 


(l + /(l + (l + ->) 

V npq//\ n p .J\ n^J 


be transformed into a form of simple summation of differential products, it is 
difficult to find the mean values of Sfa 2 or (8<f> i 2 ) 2 

$Tl 8fl 

Now, in the usual cases, _ *' and -zr^ may be considered as less than unity and 

n P . w* J J 

we can expand the above expression into a convergent power series of the logarithmic 

differentials. 

As the expression for becomes very long and complicated, let us write 


<*i = 

$n p . ^ hn. 

a 

i 



d% — 

kh 

Sn P . 
n p . ' 

.^»+i 

n. Q 


II 

(!*)■+ 




\ Wp. / 

\n p . 

/ n. q 

n„. \, 


flpq 2 
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then 


n« 


n„.n. 


( 1+ fe)’ 

\ ^9Q / ' flp, Wp# Tip, / 

x('l-^s + ^i_^!i + ...’) 

\ n. a n. tf * n./ / 

= W f 1 + ^g? y f 1 — — jjj^ g ^g** _J_ ^ n *« fa eq ^ ^ n «q 8 __ ^p. B _ \ 

w \ *W \ rip. W, c Tip. 2 rip.n. a * r w. a 8 rip 8 ’/ 

= Wpcr ^1 + (1 — d x + d 8 — 4 di - ...) 

-«„{i-( (i .- 2 ^) + (i,-^^%‘)-..}. 

Sn„. $»„ 


Or, if we write again as follows, 


8 =d - 2 ^ r> ” : 
n«, 


-=-= + _ 

n,. n 


nK 
^ ri ’ 

•Q n VQ 

Stir ' 2 


and so on, then 


and 


therefore 


*.-*-«.<%)**(>? . :■(»). 


' U VQ (1 — ^1 4- Sj *" ^8 4 ^4 — • • •) 

if 7lpq2 ) 

\n v .nj 


n p .n. Q 

1 4 0i 2 «£( 

= 8 (Upg) + 8 [Upg (— Si 4 Sj — Sa 4 ...)} 
= 1 4 $ a 4 8 [upg (— S x + S a — Sa 4 ...)}, 


•(9), 


Bcf>i 2 = S [upg (— S x 4 B 2 — Sa 4 S* — ...)}, 


.( 10 ). 


Thus S^x 2 is expressed as a form of simple summation which is a convenient form 
for finding mean values, while S m is a homogeneous expression of order m, in the 


St? 

logarithmic differentials , — 

n«o w 


and of the form 


«m - dm - 2dm-l (^) + dm-a (^)’ 

= /H. m . H. ro_1 Sw^ , 8». a m \ 

U,. m n,."* -1 • n« + "■ + n^y 

/gn M \ ( Sn,.”* -1 | K”- 8 Sn* , 

\/ In,."* -1 ‘ n* T n.,"* -1 ) 

fSm,."* -2 8«,."*-« 8 b„ , , Sra.,"* - *) 

V n M ) \ n,."* - * + ' n* + + f ’ 
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But as this expression for 5</>i* is an infinite series, it is very difficult, someti mes 
impossible, to get any finite and exact expressions for pi and ft*'. What we can do 
is only to get certain approximate expressions for Mean 8^j* and Mean (&<f >and 
even then we have to consider the question of the degree of approximation to get 
any adequate estimates for practical purposes. 


III. Expressions for First Approximation. 

(4) Now, as the first approximation, let us retain terms of 8^ x * and (8<fn*) a only 
to the differential products of second order and let us use, for simplicity, a square 
bracket, [ ], as the symbol for “mean value in repeated samples”; then, from the 
Equation (10), we get 

= S {«„(-& + «,)}, 

and W) 2 “{--S(«„«!)}* ...(11), 

therefore Mi' = [S<£i a ] = S ([8 a ] - [Si])). 

Now [SJ = R_V + -•*-2 

L n v "-a n«rJ 

_ [jj»y] + Q.«1 _ 2 [&w w ] 

71 pQ 

But [Sn,.] - [8wJ = [&i OT ] = 0, 

therefore [8 X ] = 0. 


Secondly, let us consider the mean [8»], 

rg i = [H- s ] + [KJwJ + _ 2 [8w,,8w [8w w a ] 

* J n v ? n,.n«, nj \ n. Q n„ J S** ’ 

and the following equations can easily be deduced from the fundamental formulae (5), 

[K.*] = n„.(l-^), [«**•)-»„ (l-^), 

and [8n„. 8nJ = n„ - .(12). 

And, from these equations, we get 



-2 








>MiO).«ay. 


and 


(18) 
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5) Now let us consider the mean of 
From the equation (11) we get 

[W^-Mean (S{uM* 

and we have first to expand the right-hand side of this equation, 

<* i”- 8 *')’ - {* (*- fe)F + {* (“- t?)F +4 { s (*- £)F 


881 


+ »(«„ £) S («« £) - «(*. £)» («- £) 






Sn^,\ „ / 5n, 


- 4S (“»t?) s (“-ir).< 14 >- 


'.ff / X 

Let us write, for simplicity, as follows: 

t 

S ( u vq) = 
p 


P-k p**k / m 2 \ 

p ?i ( “ m)= pi 


and 


$ (U’ W ) 333 Mp., 
<7 = 1 


then K-fe)F-{J.!?Sv} : 

= ('"5* »>.&»».)* 
U-i n 9 . J 

(u p *8n p * 


= 5 +-Sf-S'(“"“^K.K'.) .........(15o), 

p V «*.* / p p' \n p .n P '. 9 ) \ /> 


P = * 


where 8 stands for 8 and S' is a symbol for the summation from />'= 1 to p'** tc, 

p |>=1 p* 

excepting the case p' 

For instance, 

&'( U v) ** w l. + W*. + ••• + W (f>-1). + 4*ti <p+ 2).+ • •• +w*.. 

p' 

Similarly S'(u«)** u.a +... + + tf.<*+i) + ... + 

9' 

We can also easily get the following expressions for the other terms of the 
expression (14): 


r ' tofM 1 « 


i a at 




\ 


a 
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I s (■- £)F- s +s ?' ces 

+ *f (S£ *'•*-) +8 ff GE£ 8 ”-M. (16 '' ) ' 

S (n„ ?>) 5 L - S &•*&.*) + SS' (££* 

\ ™ n p J \ " n^J \n M n p . w "/ y W»V« v 7 

.(15 e), 

s («» f;) * (*- £) - * (£!-;**>*”>) + ®f GiS **-M 

.(15/). 

It is necessary first to find the means [&?„.&?„#.] and besides those 

already given. 

But from the fundamental formulae (5), we can deduce at once 


[&•»&*.]=- • 


Now let us substitute the expressions (15) in (14), and take mean values, then 
from the Equations (12), (14) and (16), after simplification and transformation, we 
get the following expression for ^ as its first approximation: 

[(8<f>i 2 ) 2 ] = [(S (ttjxr Si)) 2 ] 

= 4 8 (^) - 3 8 ( U * ? ) - 85 (~A + 2 8 ( Up ^^) 

p \ n v J q \n. Q J \ h p .n. Q ) v 
= ^2 0 ), say.(17), 

which, as we shall see below, is of order ^ *. 

Now yu* = {<*v} 2 = /V(D - (Him ) 2 . 


b " 1 t (‘-yo-y} 

is of order ^ as we shall see later*, and therefore {/ii' a) | a is of order while 
yua'd) is of order ^. 

Therefore as the expression of first approximation for ytig, we may omit {y&i'a)} 2 
and we have 

^ -«(”e) - s ? fe’) - s ? (£) +ss (NSS 3 ).< i8 >’ 

»d M» tf .S W + 8 {i(l-y .■(“)• 


See Article (18). 
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Note (i). If = [n„ - j 

and be the contribution to the contingency from a single row and (f> q * from a 
single column, then we have, approximately, 

**f ■ 48 (*-' it ) +2S (*w it) - 3 f(0- 3S 0 • 

as has been shown by Prof. K. Pearson and Mr J. Blakeman*. 

In my notation 

I O 2 __ I A Wj|» J J A 

+ and ^ jf- 

If we substitute these values in the right-hand side of those authors* equation 
and transform, then it becomes 

48 (^) + 2/S - 3/8 - 3/S (V) = Pi (1) , 

xWp^/ \ n v .n<, J p \n v .) q \n^J 

as we should expect, for 

{o-^*} 2 « 4<^>x a [o-$ x ] 2 approximately. 

Note (ii). The expression (19) for the Mean </>i 2 can be transformed into the 
form 

Mean W ' ' f ~*> + (l 4)*' + S fe,) 

. Cf / Xpj 8 \ _ o ( Xvi (™p- + n -q) \ 

l«AV V n v ?nj )' 

71 71 

where Xpq * w** -“ contingency of the jo, yth cell. 

The Mean ^ x 2 , expressed in powers of Xvq in this form, was foqnd by Prof. 
K. Pearson many years ago (1910). 

Now Xvq is a relatively Bmall quantity, and as it is partly positive and partly 
negative, the 3rd and 4th terms in this expression are likely to be small, and the 
first two terms depend on X, k, N and $ 2 only. Therefore, if we consider samples 
of the same size drawn from the same population, the term which contributes most 
to the variation of Mean 0i a will be the 5th term, i.e. 


o (Xv q g Qv ^ n *a) \ 

V ni'nj r 


Let (7, X, Y and Z stand for 


(/t-JKX 1) S (~%£ 1 ) and 

N \n p .nj \n p *n<, 2 J \ »,.*«<* J 

respectively. 


See Biometrika, Vol. v. p. 196. 
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With this notation the sample value of fa* will be in the long run too large by 
an amount, which on the average is given by 

fr'^C + ^fi + X+Y-Z. 

We may now ask whether if we were to substitute the sample fa* for fa and the 
sample coll frequencies into the expressions X, F, and Z, we should get a reasonable 
approximation to the true /**', and further what is the relative importance of the 
five different terms in this estimate. To examine these points I have taken two 
sets of ten samples drawn from the population of Tables V and VII of Article (15), 
which will be considered in another connection below (Article (19)). I obtained the 
following results, the line above each table giving the constants for the population. 


TABLE I. 

2 x 2 -table, fa = -494 950, N = 100, C = -01. 


4> i a 

i* 

X 

Y 

Z 


•274 350 

•008 230 

•000000 

- -006618 

•oil 135 

•000477 

•401 296 

•012039 

■000407 

-•002 611 

•016 708 

•003217 

•440000 

•013 200 

•000 000 

- -001 649 

•014930 

•006621 

•460000 

•013 800 

*000 956 

- *000 609 

•020 234 

*003913 

•481 668 

•014 450 

•000 000 

- *000 399 

•020069 

*003 982 

•519 592 

•015 588 

•000 472 

■000 984 

■021 783 

•005 261 

•540000 

•016200 

*000 000 

*001 351 

•022 500 

006051 

•560022* 

•016 801 

-•000120 

•001 884 

•022663 

*005 902 

•600126 

•018004 

- *001 352 

•003966 

•023671 

•006 947 

•725 656 

•021 767 

-•000 206 

•006 768 

•029 420 

•008 909 


3 x 3 -table, fa = -188 893, JV= 200, -02. 


4i a 

N 9 ' 

X 

Y 

Z 

Ml' 

•080840 

•001126 

•000769 

- *001 608 

•002992 

•017395 

•150002 

•002 250 

•000502 

-•000 477 

•605 038 

•017237 

•172689 

*002590 

•001 445 

•000096 

•006440 

•017 691 

•184 136 

•002 762 

•000 713 

•000057 

*005 290 

•017 642 

•195403 

•002931 

•002 342 

•000243 

•006682 

•018834 

•206624 

•003099 

•000612 

•000622 

•007 456 

•016877 

*230464 

•003457 

•001819 

•001326 

•008 784 

•017818 

•249042 

•003736 

•002 469 

•001 533 

•009371 

•018367 

*281 469 

•004 222 

•001462 

•002 233 

•010076 

•017842 

•370 502 

•005 658 

•001053 

•003 642 

•012692 

•017561 


It will be seen that while O and the 


3 

N 


fa* and Z terms are the largest, X and Y 


being of a smaller order, it would yet scarcely be safe to neglect the latter. In the first 
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illustration we Bee that /t, is at most only about ^th of $*; in the second illustration 
/* 1 is about i^th of $*, but we see that it is remarkably steady. A larger amount of 
experimental work would be requisite, however, before we could interpret the exact 
meaning of these results *. 

(6) Numerical Illustration*. 

Now let us take'a fourfold contingency table, where the frequencies of the 
sampled population, reduced to sample size of 100, are as given in Table 11, and 
find the mean and standard deviation of <f> i*, applying the formulae (18) and (19). 


TABLE II. 


15 

25 

40 

40 

20 

60 

55 

45 

^v=ioo 


* [It may not be without interest to indicate the method used by me many years ago to correct 
contingency by the result: 

Mean 0,» = 6’ + + P+X+Y-Z. 

I argued that the mean fa 2 was unlikely to be far removed from the modal fa 2 of samples. Henoe 
although mean fa 2 was not the most likely value to be obtained in a sample, it was the best approach to 
such a value. We have then 

(Sample tf+Z- C-X-Y), 

where Z, X and Y are to be found from the sample. Our object is to find the correction on fa 2 which 


will give us a good value for the value fa 2 being too great. Applying this formula to 

our two tables 

we have for <j>* for the respective samples: 



2 x 2 -table 

8 x 9-table 


•278 789 

•068 517 


*898178 

•188 020 


•488 572 

•155 259 


•456 201 

•166 164 


•477 802 

•176 847 


*514 484 

*189 996 


•595 096 

•212 909 


•554 292 

•280 947 


•598 882 

•263 891 


•717 828 

•858 201 


Mean of 10 eorreoted fa 2 &<P 2 

Mean of 10 corrected fa 2 


* *495 411 -000 461 

•194 575 

■005 682 

Mean of 10 uncorreoted fa 2 

Mean of 10 uncorreoted fa * 


•500 261 -005 811 

•212 117 

•028 224 

Population value $ 2 

Population value £ s 


•494 950 

•188898 



We see that while all fa* are lowered by correction so that the values of fa* below are slightly 
worsened, the average on ten samples is very elose to the true value. I found, however, the corrections too 
troublesome to be made in the case of tables ranging from 4 x 4 to 7 x 7 oells, where they are chiefly 
needed.—Bn.] 
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Here \ = tc « 2 ; 

Hu = 15, %a=25, n 2 i = 40, — 20; 

Tii. = 40, n 2 . = 60, n.i = 55, n* 2 = 45. 


From these numerical values we get 




%i 102 273, Mi 2 

- -347 222, 



^21 = “484 855, Ui 2 

-•148148; 


Ui. « Uu 

+ nig = *449 495, u 2 . 

= •633003, 



^.i =‘587 128, t4.2 

= •495 370; 

and, consequently, 





8 = 
\tlpgj 

•012 494, S (^) 4 

■ $ (“«) = -023 450, 

,, \n^J 


p. u .qHpq\ __ 

•011 720, S(u M ) 

= 1 + ^ a = 1-082 498, 

\ 

n v .n. q ) 

and 

8 \~ m ( 

^ __ 7/ pa\ / j_V)l . 

^ Wj./ \ '^•q) j 

= 009 260; 

therefore 

P id) 

= ‘009 260, //-2 (i) =** 

003 066, 

and accordingly 


^= 091758, 


and 


o-^a = 0554. 



(7) The Case of no Contingency . 

Now let us consider the special case of no contingency. In this case 
Tipg = t for any j) and q, 

and, consequently, for any such population, we have the following special relations 


and 


m N ’ 

1 _ 

r 


Up. — S (u-pq) — **»•> 


U„/ N’ 

4^)4- ?(tV?(S34- 

s fc (* ■- y (> ■- y j 4 8 io -- t) (»■- 5?)} 

4 {«<*>■- *«&) - ?«( j ?) + a * s < 5 -4 


A r 


(X* — * — X -f 1} 


— N (* — ~ ' 


( 20 ); 
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therefore 

and 


Mi (i) —■ (# — 1) 1) 


Mac it - 4S (~) - SS ( l ^-) - 3S (?*) + 28 

\^PqJ p vip./ q \^<p/ \ / 


-i_ 6 1 

iV ■tf + 2V 

= 0 . 


( 21 ), 


In this case, as /a 2 ' becomes identically zero, we cannot get any estimate for the 
standard deviation of fa* from the formulae (17) and (19) of the first approxi¬ 
mation, and we have to find more exact expressions for and /i*' to get any 
adequate estimate of fa 1 and its standard error in such special cases. 

Therefore, we need next to find expressions for p{ and of one higher order 
of approximation. 


IV. Expression for the of fa 2 to a Second Approximation. 

(8) Now “order” in statistical meaning is not the same as order in mathematics, 
and in the evaluation of [&fa*] and [(8<£i 2 ) 2 ] some terms of the 2nd order in 
statistical meaning come from the 4th order differential products, and we have 
now to retain terms of 8fa 2 to the 4th differential products. 

Thus we get the following relations as equations for starting: 


8 fa 2 — S J (— 8 i -f 82 — 83 + 84)} .(22), 

and (&#>i 2 ) 2 = [S (m w 8 i )) 2 4 (S(u m 8 s)}* 

-2S(u w 8 1 )8(u„8 2 ) + 28(u„8 l )S(u„8 3 ) .(28). 

From the Equations (13) and (22), 

Mi' = t^i 2 ] - Mid) “ a [u„ ([S3] - [«.])! .( 24 ), 

and we have now to find the means of [83] and [S 4 ]. 


. [8m*. 3 ] [8n„.*$n.J [8w*.$V] . [$«•/! 

No» [8J- + + 

_ a 

( n*. 2 n P Ji. Q nj ) 

U J 


[Sripj 2 Sji *.] ^ [8m* 0 ! 8m. ( i ]| 


and, to find [S3], means of the following type must be found: 

[Sn* 3 ], [Sa^Sn,.], 

[871*, 8 m,. 2 ] and [Sn^Sn^Sn^]. 


.(25), 


36—2 
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But the following equations can easily be deduced from the fundamental 
formulae (5) and most of them are given in Biometrika, Vol. XII. p. 268: 

[*»,.•]-Up. (i-%?) 

[Sn p . a 8tt^] * ^1 — ^ > 

[S»„ 8 Srap.] = ««,(!“ y) ( 1 “ ^ ”«*) ’ 

[Sn^Sn,. 8 ] = iifQ ^1 — — jy , 

[S»„6»,.8» J - (* - (l - V) - % (*»'- ^-).<26)- 

Therefore, from (25) and (26), 


+GH) (&-»)+(£-»)(&-») 





N] \n.„ 

f m U VQ 

1 

ji p .n 

N 

1 \ 

(1 2\ 

~n) 

N) 


= (_ l - + ^ .(27); 

\n P . n* N) \n p .n« n p . n. 9 n^J 


secondly, 

[SJ - - 2 [<*. + [-** ^*] 

[8» p . 4 ] + [Swp^Sn^] + [Sup^Sn^ 8 ] + [ gwp.&w., 8 ] 
n „. 4 n p ?n^ «*.*«.,* n P .n 8 


«« 4 


■i-j 

I 


[Sa^Sn*. 8 ] + [&i M 8n,’fin.,] 


55 a 




[ grjptSwp .ga^ 8 ] [Swp,Sn^]| 

K* ) 




1 ([Sw^Snp. 8 ] [Sra^Snp.Sn^] [8**#,*^.,*]) t'9'i) 

+ K?{n p * + ~~T-n<r" + iQ~~\ . m 

and we have to find means of the following products of 4th order: 

[Sw,,. 4 ], [S»p. a Sn M ], [Sn^Sn,,, 8 ], [$n,. 8 $nj, [8n s 8 Sn^ 8 3, 

and [fin^Snp.finJ. 

But the expressions for the first three means are given in Biometrika, Vol. XII. 
p. 268, and we have here to find only the expressions for the last four means. 
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Let as, for example, find the expressions for the mean of Bn p *Bn^*. 

Now Bn p }BnJ m, Bn p .Bn^ (Sn w + S'8n w .) (Sn w + $'$«„»,) 

«' p' 

*■ + S' (8a OT Bn p ,Bn^Bn P >q') + S* (Sn N S)iy,Sn 4 Sti M <) 

p' 4 ' 

+ <S'S' (Sn J(1 8n. # 8n w ,.Sri 1 ,, # ).(29). 

p' ' 

But Sn M a Sn p .Sn^ = (Bn^ + S' 8n w .) (Bn^Sn^) 

9' 

= Sn^, 8 Sri* + S' Sri*,, Sn M a (Sn w + S' Bn p . Q ) 

9’ p' 

— Bn^Bn^, + S' (Bti^.Bn^ 8 ) + S' S' (Bn M a Bn p > t 8n M <) .. .(30), 

4' 4' P' 

and from the fundamental formulae (5) 

[&•*•&»«] - «p. (i - j 1 + » (l - |) n„ (l - **)}, 

[S"*"^]—^{l + 3 (l - |) (l - %?)} , 

[SrV&w&v*] = -(i-|) r ‘”^' e - (l -1 *„).(31). 

Therefore, from (30) and (31), 


[Sri^Swp.Sn J = -f S' [8w w 8 &i,**] 4 - S' S' [8?* w a Sn^Sw*^] 

q’ p' q' 

- a »( 1 - 5 i){ ,+ 3 ( 1 -l) 5 ”( 1 -y)} 

“ ^ («p- - «*») I 1 + 8 (l - n M (l - jj?)J 

— ^ ^1 — n M (n p . — rip,,) (n^ — ripj) ^1 — jy- n poj 

-»- i 1 - Y - w + ¥) + 3 *-■ i 1 - I) (>- t) (> ■- 
1 / 2 \ 

•" jf ^ ““ Jtfj n m (*** ™pq) .(®^ a )* 


Similarly we can deduce 

[Sftj>.Sti.g8n w &v fl ]*“ fipq n p*q jy^ (l "" ^ n *» ^ “1" jyi ^j>.w^...(316), 

= ^j>q n pq' ““ + jy?2 n v n *^ *..(31 c), 

and [&i*. Sn* &v a Sn M /] 

-fWW {‘ + (! “ y) (i - 7 - y - T + r.».-»<)}-( 31 -a • 






390 Standard Error of the Mean Square Contingency 


get 


From these equations, after substitution, transformation and simplification, we 
[$/?„.*&?„*] = «« (l - ^ “ n„) (m* - n„) 

+ (* ■- D i 2 *■•»« (■ - ¥) {' - n i)}-» 2 >- 


In the same way we can deduce exact expressions for the other three means. 

But all these differential products are of the 4th order and, as our present aim 
is to get approximate expressions for fix and to the 2nd order in statistical 
sense, some terms of them can be neglected in our calculation. 

Let us, therefore, first examine the statistical order of these means. 

Now let f 8 be the proportion in the sth population group, then 


/• = and if 8 = 


N 




— jv*”*('“ i\r){ 1+3 " , ( 1 if)(' j)} 

~f 8 (1 ~~f*) |j^3 + jfif* 

- 4 P (1 “ fs? + m/s (1 -fs) a - 6/. + 6/?) .(33). 


' jy2 J K JSf ^ ]\f3 

By the same method of deduction 


[ 8 /i 3 ] = ir 2 /«(l-/«)(l- 2 /,), 


and we can see that the second term of [8/* 4 ]> in the last expression of (33), is of 
one higher order in than the first term, or than [S/« 8 ], a mean of the 3rd order 
differential products, and we may therefore neglect this second term of [8/, 4 ]. 

Thus we have a very simple approximate expression for [8//], which is exact 
enough for our present purposes, namely, 

[S/* 4 ] = §if* (1 - /») 2 or [SO = 3n.» (l -|) 2 .(34 a). 

The following equations are all deduced in the same way and are fundamental 
approximate expressions, in this work, for the mean differential products of the 
4th order: 

[SOO = - 1 «,»«.<( l-J), 

[SOOJ - n,n, (l -1 -1 Mr), 
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[W&V&V ] «= - n,n,.n,.. ^1 - n.'j , 


and also 


3 

885 2^a n*n t >n*>'Tiz', 


.(346), 


[KK. J ] = 3 «,.n w (l-|) i , 

= iipq ^1 jyr) ^2w w + v yf > 

[Swj^&ip^&nj,.] = TipqTipg, ^1 — ~^j ^1 — n poj i 
[SWj^Swpj/Sri.pg./SHj,.] = — n^n^n^. ^1 — -y) . 


Now lot us find the mean [&a p . 2 8fr. a 2 ] again, and also other necessary means, 
starting from these approximate fundamental equations; then we get very simple 
expressions for them as follows: 

[SV^*] - K.n., (l - (l - + 2 (ii M - f ^ 2 


[Sw,. 8 ^, J = 3n„. (l - n j~^J (n m -”®^*) 

O^Sn, .*$«.,] = n„ (l - (in,* + n„. - ^ n v .n^j , 

id 

\&npq 2 &n P . 8r/^] = Bn^ 2 ^1 — ~~ npQ ^ npmUpq ^ ( n * q "" 

Now we can find the mean [84] easily. 

From the Equations (28), (34) and (35), 

w - 3 (i -»)’ +3 ii - »)’ +3 (c - y) - y) 


|...(35). 


+3 (s, _ y) (sfS; - y) + (t" y)(s; - y ) +2 ~ y) 

+ (s; - s) (£. + ij; -y) + (s, - y) (£. + ~ y) 


(1 

1\ 

1 _ 


u.„ 

n) 

~ jy rt J>« 

U«, «„./ 


—{*0c-i)'*»0Q-»)‘ + <s:-»)(at + ^-» 

+ (q-i)( 5 i + £-J)} 

- J- (l - ?*) fl - ?») (~ + J- + 2 — i) .(36). 

n M \ »„./ \ n^J \n p . n<, n„.n. q iv/ 
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Finally, from the Equations (24), (27), (36) and (13), and after transformation 
and simplification, we get the following equation for /*,' as its second approximation: 

-S {!•*([«■-[«.] + [«J)} 

. (87> 

V. Expression for the of <f> x % to a Second Approximation. 

(9) Now let us consider the second moment coefficient /**' of the mean square 
contingency <f > x 2 . 

From the Equation (23) 

M 2 * Ms'a> - 2 [8 {***) S (**«] + [(flf KS*})*] + 2 [^(^8,)0 («**)], 

but the last three terms on the right-hand side are not in the form of simple 
summations and we have at first to transform them into forms of sums of differential 
products. 

For instance, the mean values [Sw w 8 ], [8 ^ m s &i m /] cannot be treated formally as 
special cases of their general form in mathematical meaning, we 

have not only to expand the above products and powers, but also to examine, classify 
and arrange all possible products so that we can apply at once the fundamental 
formulae (5) or (34), or those formulae which I have deduced already. 

Now 

S (w m 8 i) S (u^Sz) 

- * (“" {* - 2 If I) 8 (“»{* - t + w 

= S («*<*,) 8 (u„d 2 ) - 2flf (v„ch) 8 ) 

S(«W« B („„ 5jSj) - 2 B 


48 (“» £?) * (*■* |?) " « (“- |f) 8 (*~ te*) 


Ti- 2 T t + T 3 ~ 2 T t + 4 Ti- 2T t> say 


2i (the first term of (38)) 
= S (u^di) S (u^dt) 


8 Ml: + If)} 8 {- (£ + ^-Tf + !?')} 

■ {f (“*-^) - f (“- Sf)} {f (“- fe*) 

+ <8 (u« ^4) + 8 8 (u M l 

v \ * «<,*/ p q \ M n t .n. a ) J 





■» oa 
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- f (-• tf) f (”»• tO + f(*-lf)? («• I?’) 

♦ f (**• t=) ?(*-t0 + ?(*< If) ? (”*• t?) 

♦? (»*• t') s (*- tit 5 ) + ? (“' If) 8 (“- v£) 

(”••’ to + f (*■>* I?) + f f (■“*•'t>tr') 

. a o / 8n^Sn^.\ , 0 / {&&,.*$»?.<, , &n p .8n<,*) 

+ SS' lU^U^, _ ' ) + <Sf m,.«< 1 T 1 -- T - a If 

, ln,}hn. 9 , Sh„.S«^ 2 \ 0 , ( 8n p .8n v ..8n«\ 

+ v»f + -** ~wr s ? m) 

+ SS- (»„«... ).(38a). 

Similarly 

r,- 

V, ( flpq'N'p. Mpq 1l pm H^f j ( i H'pq t ft,q t^pq^p.^ ,q )/ 

+ SS' U.u„ 

p' \ l tlpfqVlp'Vl^i J / 

+ B8' («,««. {*5t*f^*- + V^=»h.(38 ft), 

g' \ ( fipq*'ft.q l R’»q' Mpq*W'p,tt^[ )/ 

T,'rnt s (u u + u u 

■*-3 B^u,. n„'n v . + ) 

+ SS' (up.Uji'q B y . M *A + SS' («„«„. ^f) .. .(38 c), 
p'\ n P .n P , a * J Q ' \ * n^ripq, 2 J v 7 

T — sfu U I u u , u 2 

Tt-S^UpqUp. -—^- + UpQ u ^~^ t +H*. J 

+ 88 ' («„«„, + u Sn^a^ Sn^ 

o' v n^ripq. w " n^fip.ripq, ) 

+s f s ;(."V- t;t;.t y ) . 

+? £lf}) 

+ fifS' / ffig&Wjgg. + an TC 8 n,, > ^)\ 

+ ££' fu„u„, (awgS^jn^ K>wK.l\ 

+ 8 S’S' (u„up.,. + %^»'<L^'lV..(38 e), 

|>' g' \ ( ftpqfopfqfflp', j/ 
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„„d t. . s („„• g) + mt («„»„,*££) + aw 

+ss ; itfeO.^ 

From these expressions, it is evident that we have to find the following kinds 
of means, besides those already given, which are all exact and can all be deduced 
by the same method of transformation as in Article (8): 

1 / 2 \ ' 

= — jjr'Hpqlt'y'q 

\ f 2 \ 

= jjy- ( Wp-Wj )*q *4" 1lp**1l pq ““ Jiy Hp^lp'• , N'*qJ 

\ ~ / 2 \ 

= — -jy n>p.Mp’q ( 1 

1 / 2 \ 

[&?3j'fl'SWp.Sfi.g] = — ^ Jf n v n »QJ K .(39)* 


1 / g \ 

= — jy- Hp.'Hp'q jy Mp'qj 

x _ / 2 _ \ 

[Sw^&Vd " ~ jy - jf Up-) 

l / 2 \ 

and t qfal jy Wj'd | ?ip, “I* ^pq “ jy t W , p» ( N , »qJ 

Now we can easily find the means of any T-terms. For instance, let us find the 
mean of T%, 

From the Equations (26), (38 a) and (39), we get 

m = s (|^3 [K. 3 ]) + 8 (|* [««./]) + 8 S' fej-' [&*•&,,.]) 

+ ? s ; (St [ *- f M +1 ® f («3S; 

+ *?: ttSfc + * {S: J 

" ? K* fc. _ i) (v ~ #)} + ? {“■*’ (£ _ i) fc " y)} 

“ y ? ?' K v * (£ ~ 1)} _ y ® f {“•*“•"' (i, ~ f)} 

+8 {«».«., [(^ - #) ( ri ;|- - j) + Gq ~ y) Gdfc ” n )] 

+upqUp -1 ~ n) Grft ~^) + ~ f) _ y)} 
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If we use for simplicity the expressions in (40) below, then after transformation 
and simplification, we get a fairly short expression (41 a) for the mean of T x , 


S t ' = S + s(^)} 

p ) q \^tQ^ I 

^‘- s M +s M 

Si=s(^) + s(^) I 


p \'*p./ q 


[Ti] = *" -1 S,’ - J (l + ^)+~(l+ pf -1 8 

+ s j»E|VV(l_ + 1)1.(41 a). 

In the same way, we can obtain the following expressions for the other T-terms: 

[rj - w - |r ft' -1 &' (i + * *)+Js (i + £*) a 

+ . . (114) , 

™—y «■' + F- (> + *>> +s fe & + s;)} 

-» s © (1 + #,) . <«'>• 

[ZV]+ j 2 (l + ^-^V(l + ^) 

+s (d*K <1+ M£).•<“■*>• 

[2 1 .] — y - y # a + <£ 2 ) + ^a + £ 2 ) 2 

+ S fc(n7. + ^“^)}. (41e)> 

and [JVI = > + W 4< 1 + *‘> S (£) ■+* fe" (£ - y)} («/)• 
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Substituting these results in the expression (38), and after transformation 
and simplification, I have obtained the following equation : 

Mean 8 (w w 8 i) S (tt w 8 a) 

- - 3S -" 4 s -- 2S fe' o + ”4 - y s (£) 

+ S (~ + J- - ^)| + 4,S (J- + 

l HpQ J) vWjj, n»q/ j 

- 28 {V K. + uj + 8 { (1 + u„) (|»- + Ja)l .(42). 

(W VQ ) K n PQ ' U V» n «J ) 

(10) Transformation of the last two Terms of ^ for tf> i a . 

Finally, let us consider the terms (S (w ,*^}) 8 and 8 (w^i) 8 {^£ 3 } of the 
4th order. 

Now 


and 


(»-«)>- ( s 4 (* - 2* t w+ Wff 

-(S{u„d,W + * (s |vl £})* + (s f» 

- is [n„4| S \u„d t 5S®j + 2 S Iv jl,] s {»„ |j*} 


-4s( 


, few**, 

M w ttl ~ 


H“~^l 


.(43), 


= Tx + 47* + T, - 4T 4 + 2T* - 42V say . 

and S £ {m«,Ss] 

= 5 {«*<!,} S {« OT d»} - 28 {«„ |^J 8 {M TC da] 

- 2S M} 8 {[ydt + 4 8 (« M S {ujk 

+ 8 KA) S [u„d, £*}.- 28 {«„ %*} 8 js. A 

- ZY - 2 T t ' — 2 T 3 + 474 ' + Tt - 2 T 4 , say.(43 few). 

Among these twelve terms, let us consider the Zi-term only as an example. 

Now r,-(8KW-(s{«„ (*4 + %e.%* + %4)})‘ 

- ( s f- W + ( s f- ^f})‘ + ( s f- w 
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(? K l:;‘})' + (f f- W + *? K ? 1“-1?} 

- ? (*>■ It) + ? (“-■ v) +2s 

+ ss; + ss; .<«»>. 

(s {«. s 4^})’ - s {»„• ^5fi + s s' 

\ [ n p .n. Q )J { w fl p .*rc.„ a j p' { ** n v .n.*n v ,. ) 

«' ( n P *n. a n. a . ) v - s - ( ** P4 n p . 1 h. a n p ..n. a . ) 

and tho other two terms of (44) 

') &n v . t $n P '.8n. a \ 


oof 8n„ a 8n.„ 8n„.8v.. a 


n„. s ri.„ 


n P .n. a a 


. (44< * 

From these Equations (44), (44 a), (44 6) and (44 c), we get 


, ooi , 8n v .8n^ a o (/ s.o \ 

+26 r- w - “^17 5 i ( “ *•} 

+ 2«p.w p , ygfefc l 

I' 1 Kp.ni'.n,* * J 


+ SS 


? { * *' Bp.Rh,^ ) 

+ S8'8'{««V.—.(45a). 

i» # l 7ip,npsn^n^/ ) 
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Similarly 

r 2 =s{v( 

+ SS' | u m u v . t ^ 




Bn„ % BnJ\) 
n^ri^ /) 


+ 2 


+ SS 


WpQ Wp, 

8?? w , Slip. 8n 


W'pq'ft'p'q 7%p, Up*, 

&lpq&flpqlBTl^hl r l t q* ^ g 

£n«>. , SupqSnvqi&nqBn 


, &*!>» &«!><# $w<r*\ 
4 Z 


n pq* l p’q 


'g** 2 /} 


5 + ^pq^ixr^g ^l 


* S'{“««»'( 

4 $ /S 7 i?4 ti / 8>ijgfepv^W p. 8w|»*. ^ 8^yq Bti P > Q *Bu ^Sn ^A 

p* w 2,8 \ ^pqVipfqtiip.'Hpr, Upqii ptqtn^u^t ) 

+ 2 V «'w('^’-'”'^-*)(.(««, 

\ n^qUpg.np.n^q J) 


T __ G ,* 2 ^JX? 4 | i cf o' L. ,.. ^PQ Bllp’q 

Iz — O lUpq - 4 ^ + 00 8s; 8 

n vq J P \ n PQ n V'* 


+ SS 

Q‘ 


?' {“■ 


Bn^Bu^ 
^'njn^ 
'Bn„. a Bn p 


+ SS'S‘ 

V 


rs' \ u w u, 

' </ 


n OT a nj ,.„- 2 


.(45 c), 


r 4 =s j« M « p . 

( \ n P . 3 n w n v ?n. q n PQ J 

4 


4## 


(Sn T 
\ n 

^ /S' *|wp* Mp'gr ^ 


pq Hp? H. q 


8 iipqSnp^Bn.g &n Pa $Mp.hn.q 2 


Vlpq Tl p% 71 ,q 


, hi V ' Q ln v ?h\. y 

n v . Q n v }n v .. 


) (^5= + 

/ V ft.# 8 n m n. Q 2 n P . n m )) 

<i I ,. 0 Sn v . 2 Bn. Q \ 

^'g W,g ) 


... .„ (■<!&>'=>>■ . 

7MI rt V n m n p Ji. u n p .. /J 

+as- 

q' ( \ Hjw'H'tq Vlp. J 

/Bn^Bt^. Bu^Bn.g- + &t OT S//. () ,Sn P . 2 \j 

V n m n P .n. Q n. v . »!„».,■ m„.® /) 


+ m«,m« 


+ <S-S'-S' 


^pof 


4SS' 

v' 


/Btipq &ti p , 8/?p/. 8 ??,q• ^ 8wj^8fi,^ 8/ij,/, 8 ??<,uA| ^415 

8 \ np q n v .np*Ji.q. n vq n.qn V '.n.q. J ) 

w ( Bn^Btip . 2 bright . 2 ' t Bn Pq 2 Bn v ,Bn. 1 ^ 

l * npq 2 n p * ™ npq*n. Q 2 ™ n^Jip.n.q j 

( npi Q a fip. 2 ** 1,(7 n p , 2 n p .n. q J 

o/ ( SnpqPfoi.q 2 bupqfSnp.Bn.q] 

S iu.qilpq. —^-s-i 2 ~ 4 U VQ U VQ' —=r-+~~- - r 
q'{ npqSn.q 2 w w n vg Hp.n. q J 


4SS # 

q' 


Upqfin.q 2 

+ S S’ S' \u„u p . a .^s .(45 e), 

p- q- ( n p .n. Q n P . a . a J 

f 


ftp.Hw, 3 


Ji - S + »»■ 

n pq ) 


+ SS' \u„ v„ 

p' ( \ Wpg tlpfqflp*, 71 pq fipfq¥i p q J J 
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+ a s' \u„u„. 

<f l \ V^wr 1 ”-.' V n wr' n J>- )) 


+ 8 S' 8 


i /8 j Ww tw( 


&V Sn g .. 8w w , 3 &/y (l >Sffl.,A '| 


(45/), 


|^l +/S k tf * $5*1 + SS' {«„.«*•• 
ft Vi 9 l ».* 4 J pp' 1 WJ 

rjQ/j &&. a 8 8w. a *] , C y j S/i^S/i*. 

+ S/S \u. q u. q ' V—- V + S iV” 

<,q'{ n.fn ^J ( w '/i*. 3 w. a ** n,>„. 

+ ,.„ (v+ «.,^! + „,.^(^ + ^} 

+ss' {u^iv. ( g a ffi-«jv- + 

P ' l \ n 9 *n tq n 9 ,. n p .n. q 2 n p ^ J j 

+ SS'L„uJ^^^ + ^\*?4^)\ .(46 a), 

q ' l \ n p .n^n^ n P *n. q n. q . J J 7 

r 2 '= S {«,*«„. *??*?* + S "-%^ + uj + 

( Wj), V ^•Q^pq 1lp,Vl,q Hpq /) 

+ SS’ j U V .'U P . B y] S y« + u „u v . v (*?y &*«*** + !V^J»‘P1«V 

+ss- j«„.„.. %•;*””-■++^ Aj v-u 

q'* ( V IlfXI* flp, VI*q flpq' /) 

+ 8 S'S' \u m u„. q . f*±L*±&*S: + .(46 6), 

p' q' ( \ ^J). H»q1lp'q* /ip. fl,q fip'Q' /) 

V <?, Bn ^Sn^BtK * &>i pq Bn v . i $n. Q \ 
s = | «" *• \ n„n p . a n^rip.n.a 2 n m n v }ii. v ) 


(hi m hn.f Sn^Stip^Bv.g Bn„,Bv P . Sv.* 


npgUp.^fl.g 


Mpqflp. Il'Q 


. | w u f^ n vq^ l y 2 ^ n v»' ^ &n pq 8n. q * 8n' P ». ^ 8/? y<7 8Mp.&/i. q 8np > > \) 

p* ( W V fipqW'pJ^M’p*' , flpq < ft»q* t ftp‘, < Hpqtlp*1l*q$ip', /) 

+ SS' + (46c), 

q* ( V ^»g' Hjtqfl p „ 71.g' 7ipg /&<p. 7i.g W.q' /J 

n11 a j.. 2 / & n pQ 2 & n P' 2 , &H'v q t bnp, fol'.q ) 

4 ^ | W W ^ n p *n pp n. q nj*n.* /) 


Y o/ ( / &Hpq d/lp* . 8 , O'flpq Wlp'q & lp'. $n,q Sri pg 8'flp'q ) 

p' | W , n>p q W'p'Q , H' V '* u t iqfip' q 7l P '.7l, q /) 


+ 8 8' \u U ( ^* q >8 » ^ l PQ btt'PQ' $ n y » S'tt.q' , ^ n ftq &n pq’ 8n ) 

(j' 1 W W ' llpqnpq'ft, q’ 2 flpq^pq^p.fi,q* ftpqfipq&p* J) 

+ 8 S' S' \u u (topqtop' q'fa*'- 2 + SugqSnp^Snp^Sn^ + &n t)q gnp^>Syi. < y > 8 \| 

p* q* ( W \ npqtop'q'np'* ^pq^p'q^p^fltq* topqflp'q'Ti.q* /) 


.(46 d), 
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Ts—S (-J 








J \ Tipq 5t,g ?ljpg Up, W,g / j 


.ss'-kv --)! 

? ( ** " V *»*«•« V- ;j 


+/S/S' (^4^'*^' + ^4-^)1 

( V W’JJQ U.qU.q* ^PQ ^p* /) 

Sn^Brip. t Sn pq z &n. q] 


t / o j„ 2 (& n ™ w,t v- . v "** 
« “» j Wjxf l— + “" 


n w *n v . 




■» 


+/SS' L^, + y a w ( 

j/ ( \ Upq ^p'o^p* ^pq l ftp'q'R'»q *) 

+ss' \u w u w . + Bn y*y . * n -A \ 

q* | \ Wpo'It,), Upff'^'fl / ) 


+ -S-S 


hi„^hi vV hi v . , Sn^Sn^ 8n„ 


-S'/S' + TO TO 

p' q' ( \ 


.(46 e), 


Wpg 


£*)}.(46/). 


(11) Approximate Expressions for Means to the fourth Order. 

From these expressions (45) and (46), it is evident that we have still to find 
further means, besides those already given. 

I have deduced all of them from the fundamental formulae (34) by the same 
method of transformation as in Article (8), but I will write here only the results 
which I obtained. The mean values of the quantities placed in round brackets 
below each equation are of the same type as those immediately above them. 

(a) 0„. 8 $'V«1 = - n„H v - q (l - 

([8w s . 3 Stt B ..], [&«„. &«„«*]); 


Q>) 


[8h b . 2 8« b ./J — Ujj.MjjV jy + jyg w ». > 

([K. 2 «v. 1 ]); 

[^ W I>.*£ W 1/«£ W I>«'] = n pq n vq’ If ’ 

[Sw^^H'b] = ~ (l ” - 


([8n B . 8 8n WI 8w B .,.] > [8// B . 2 &). B . (I 8n WI .]); 


[8» B . a 8» BJ 8» B ..] 

[8w B . 2 8w M 8w.(,] 

[8n B .*8n B . 9 Sn. a ] 







T. Kondo 


401 


» — 7i p , fip, q 

(j* + ^ Ww ^5 r «- s -«)> 

[5»p.*&i,.,inp..] “Rp.Rp'j^l 

IS? 

K? 

* 15 , 

+ 

« ; N 

1 

i 

(o) [fin,. Sn p .. Sn M Sn,-J =n v< ,n p - 9 ^ 1 

S». Rp-. 3 \ 

y jv + f* * , - v 7* 

(fSnp. Swp.. S-n-p, Sn pV ]); 

- t 

[Sw p . Sn.q 8w w Sfjp.,] = RpjRp-a ^1 

2 H H* 8 _ 

A' N~N* n * 


[S»„. $»*,&w] = - y t»„n P v («„ + + n. e - n.„j, 

[H. 8».«3wp<r$»V«] “ ’*»' (l — ~ ~~jf + » 

[jn^.Sn M *£n v% .] 


1 / 3 \ 

jV" ^0* ■*" ^«r *"" jy^p.WpfJ » 


•»V T «n M 

([^3». **pg # &V.]. [$w,. Sn^Snp^]); 


[ Sw »-- ^ ft jw*Sn p .„] = « w « rt (l - U £ -^ + -| 2 Rp,Rp. 


([S»i„-.s« ot 2 H' 8 ']); 


)• 


(d) [Sw„. «».« &»*•] “ «„•» (« w - -^ 9 - ^* R *« 2 = =* 




N N ~ 2^ n t>- n v'9^' j>'«^ > 

[3t! p . Bn.q ^71,,/j/ 8 ] = w p - # > — iy> npqflp-t' — j^n v .n. t + j^ s «p.R. a Rp. a .^ , 

* [$ w »>. Sn t >--^«*] = “ jr n p« n p’- ( 2s w> + «p. - jf «»»,.), 

(«) [£«*. $«p.« h. 9 8k w ] = HpQ (n p , t - n„.. n„ - i n p . ft,,., - ~ n„..n. Q 

1 _ _ 3 _ _ _ \ 

* - jjj n v « p >. + n p . n p ..n.qj , 

[3»p.8Mp*.§n.. e 8« 1)J .] — fipq- ^Wp- e — n p .n p - q — n p .. n m 

1 3 \ 

N ^ V * % n ' q ^0* ^0'- ft'Qj > 

Q 

(/) $7t #<( 3W|^. ^»a'J * ^IXf^p'tf' 4” Wj,<j Qfhpq* 4“ 2^3 ^P • ^p'. ^l.<j ?t,g' 

(^p'w'^p.^.a 4* ftpg'^p'. 4- n V ' U n$,u tQ ' 4- 

[Snp.S«p..Sn.,*] » 2«p., j^R* - y «».«.„) ( 2 &p, + Bp. - ^ Rp.fi.„), 

[&»,.*Sn^. 8n. # ] = Rp. e R*. y R p . R*<. ^2 R w + R., — fi p< R. # ^ 


•(47). 

« 
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(12) The mean values of the last two terms of n%. 

It is now possible to find the means of {£(«,*$,)}* and S(w w Si)S(w„$ 4 ). 
However, as this involves also long and rather complicated calculations and trans¬ 
formations, I will give here only the results I have obtained and the outline of one 
of the deductions. 

Let us take, for instance, the first term (I*) of (S {u w 8i}) 2 and find its mean. 
From the Equation (45 a) and Formulae (34), (35), (47), we get 

m - ? (fe>‘ M + ? (Ij) m + ??' (d£* 

+ f ?' G&i-nm) * ™ c t>T; ] 

++ K‘ + } 

+ ss ; ^ [K>,».fc,]} 

+ SS'S' {[K,.8n.> J> ,.8«,. # ,]i 
p' 7 ' [n 9 .n P ,.n. q n. Q ' L v 9 p 9 J j 

=3 f {“”•* ~ w P : + f*)} + 3 f {“•»* f«); 

+ ss; {«,.«»-. ((^ - y) - jv t - f))( 

+ 6S |«. _V w *« | 1 ^ 

( " \n„. a n. a IVn B . Nn v .n.„ NV 

\n v .n. ( 

|(«i 


+s 


+ u m 

■vu + 2 m„.m 


1 

IVrL 


_2!-L 

nr- - • 


* Nn p .n. Q IV 2 )j 
■ q) ((n P . ~ (*„ ~ n) + 2 (^1, “ J) )} 


+ 2SS' 

^V<I / 

' 1 

1 \ 

_I (-?<!- + -1 _ 

3 Y\1 

w. * 

k ftp* W>tq 

NnJ 

N \rkp.n. q n p . 

ivjjj 

+ s S' |m„w p - 7 | 

9 f V<7 

( 

1 ' 

\ _ 1 + _1_ 

8\\l 

< *v. 

\n v .n.f 

Nn. Q , 

> N \n p .n. Q n, v 

tf/Jj 

+ 2SS' {«.,<, 
«' l 

(*~'I 

' 1 

1 ) 

_ 1 ( 2 «w + 1 _ 

3 Y>1 

\n. Q ‘ \ 

3p.n. Q 

N fip n J 

N'riv.n.q n. v 

IV//) 

+ SS' ^ 

9 

f _I_ 

\n p ?n. a Nn p ., 

\ l (2n M 1 

) N \n P .n. a fi P . 

-a 


VfJL. 

n w > 

1 | »w 

/ ftpq 

AVIV* 

Nn v .n.q/ 


\Sp.w. # 

1 


1 w g’» 

1 



#fc.,' ?»„•. 

Nn„. 
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As there are double and triple summations, the simplification of the above 
expression is not very easy. It is long and rather complicated, but the result 
I have reached is not so unmanageable and is as follows: 

If we write n^ and without the suffix, for simplicity, as follows, 

n w *=5i and = 

when there is no ambiguity in the meaning of these symbols, then, after simplifi¬ 
cation, the above expression for [T{\ becomes, when we put 

.(48), 

p'» 1 <y'=l \ / 

[7\] - 2IS," - ~ (1 + + wr - Sr’ (1 + $•) 

-»< ,+ MSD + **' s (s) + ( 8 {i})‘ 

+ Wi(£ + £)} +s $ ( * + M 

♦“{’-v* (*:-»)!.:.<«•>■ 

Similarly 

[*V1 - -|s;-i^(i + ^) + ^(i + + 4 s g) 






.(406), 


™ - r.d +*■>*- y 0 + *•> * g) + (« {I})’ + M {!' (g - *)}•••<«•>. 
rrj= 26 V' -1 a' ■- £ a + £ s )+^ a + s g) 

+2s © - + (w) +28 Fv* («-1)} 

+ (? fei) ,+ (?fe})' +88 {»‘ G:i + tJ}. 


[ TJ - is," - £ S,' + (1 + fv - i (1 + (•) + s (I) + 2S (i 


a+ ^ s m-- a+ #., s(i ») +S («) S 


ft/ \.B 


.(49 e). 


[5TJ-“|(1 +1 1 )*- yflV(l + #>-yS g) 

- * <> + **> 8 (I) + *' 8 (1) +28 fe‘ (l + rj. 


27—2 
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And also when 

v P . =^ x , v= P '5* ( Up ~-*) .v(50)« 

[7Y] - 36V' - J6V-|(1 + $*) ^' + ^(1+ <£*)*-i(l + ^*)6 (I*) 

+«i : i fe♦1::)}+ 8 {* +M +8 IS < ^ + M 
+28 {t’ (£♦(£+£-»)} - i - <61 “ ) - 

i m - w - £ s,- - ^ (i+1*> «,■+y, a +$•? - % a+*•> s (I*) 

+*g<*-H +iB {vfe + y}. 

PV] - 3S," -1S,’ -1(1 + p) S,' + i? (1 + pf- y (1 + *■) S (£) 

+ «g(v + ^ + *{tfe + £)}-» a C f V) 

+ 8 {£< v - + M + “f-d 2 * (£ + s.)}.< 61 ”>• 

[ JVj - 2 ,v -1 ,SV - y <1 + *■) S.’ + y-, < 1 +1’)’ - y 0 + **>« (i) 

+,s (»>) +8 { ir * + “->} +8 {I (»’. * i - 1 )}. <M <ft 

W) - w -1+ y, <1 + pf - y (l + P) S,' - y (1 + #■) « (g) 

- » 8 (“ J ¥ !a ) + 28 \t + “->} + 8 {t (£ + y i 

+8 {1 (£+£)}. 

and [5TV] = - y 4' + yi 0 + **)’-y (1 + f) &'- y (1 + P)s g) 

- f s d’) + “ f i a + »-)} +8 k fe + y}. <m />- 

From these equations (49) and (51), it is easy to prove that 
Mean (S {u^Y 

(D + M (£) + X3S ($ - f « (^) 

+48 G&a - 48 (s (i + rj) - 88 (w + y) . 

+ 8 {^ + ( 8 {|( 1 -^)( , _ j »)[)'. m . 
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Mean 8 (u M $j) S (u M S$) 

-*♦»*- > ©♦«©-;* m +s S^ + ^) 

-•• (? (S-♦ ft)) ■- 8 6 < r ’- +r ->) + “ G? (ft; + ft)) 

+ ■ 8 (I (t* ft))- 8 (“s 5 * (ft - + ft)) - s S (ft: + ft)) 


and, consequently, from the Equations (17), (23), (42), (52) and (53), 

W+yS.'- yS (|Vi8s(g) -esg) 


(53), 


■ 6« (“‘ (»,. + . j) - « (ft^J -1S (?£*) 
+ «{^( I+ „-A-i)} + ,gu„-^- 2 r.)} 

- 28 ji < 2 + •»> (ft + ft)} + 28 {s < 2 +2 "> (ft + ft)}' 

=+10S 2 " u8 (a) - 65 (a 


where 

and 


+ es g («,. + - ♦« (ftft (» + »)) + <* I 

+ 11S (| IF„) + tS («! W,,) + 8 {£ (A„ - 2 V,. - 2F..)} 

' 28 {I < 2 + 9 “> (ft + ft)} + M {* <* + 2 “> (ft + ft)} 

" r "-ft( 1 -ft)( 1 -ft) . 


(54), 

(55)*, 


YI. Mean and Standard Deviation of fa*. * 

(18) Now we can find the expressions of the second approximation for the 
mean and standard deviation of <£i*. 

From the Equations (17) and (55), 

. w ’-s{ir„(i-^ + 2|)}.(56). 

• *(?) +s (l)- s .'- s (s( lt *-4:-sq)) =s (s( 1 -i)( , -c;)) 

' s (S( 1 -fe)( 1 -ft)l* S| " w - 
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Therefore 


Mean + /*' - S(u) + S (l - i + 2 |)J. (57), 

and also from the Equations (7) and (64), 

-»'-*'-«(£)- *(£) - «■ (£) - *■ 

- io f (&) +io f (lj) + 1 If (£) + ? (61-X?) 

+ ms $ - as g) - 2 s (“g ( 2 +1)) 4- (S f *„»• 

+12 S g IT W ) + 4S W„) + GS g («,. + «.,)) 

F ->) - 2g (1 * 2 +9 »> fe+£)) 

+ 2S (| <! + 2») .(58), 


where 


n » », 


J>Q> 


u^u n 


’ n pq 2 /( n p. n .o)i 


q~\ 

W'j). = & 

7=1 


. -’S' F p . = f S* , F„ = T hf *), 

F JW = ^fl->') fl-^V A m -SS( u ^- n ^^-) , 


7 = a 

and S stands for the double summation S S . 

j >=1 7=1 

Now if r w , r,,., r mQ are the proportional values of n„., w. a respectively, 
taking the total frequency as unity, then 

u „3« 2 /(*» */r r 

” n,.n. B ViV/A-ATA/ r "l r *- r * 

= a function of proportions and of the same form, 

and, consequently, u P .u, g ; F„., F. a ; and-4 M may also be considered as functions of 

proportions only and of similar forms respectively. 

Let us consider and a again from this point of view, then 


= 4 8 

(?fS) _ 

3/S - 

3.S (—) + 2S ( U *~ 



\n„J 

P \ n„. / 

q \n. Q J \ 

«« / 

= 4/S' 

( u *\ 
\NrJ ~ 


- 3 ?(l£) + 2S ( 

'W w Mp.M.„ 

1 

{ 4 $ / ^£ 22 ! 

\ _ o« ( u f>i 

\ _ 3 $ + 2 /S 



r 

) pUp. 

/ ,VrJ + 8& 

V 


wM r x«)> say. 


or simply 


t * y* 

M* oi " jjj i- 


•(59), 
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where f\ is a function of proportions only and of the form 

fi - 45 (—) - 35 (V) - 35 f “-«*) + 25 ( U ^A .(60). 

For the same reason and by the same method of transformation, the expressions 
for f&iy fi*', ifa* and can be transformed into very simple forms as follows: 

+ (MX 

w'-y/' + y-./- .<«*X 

and, consequently, fa* = <j> a + ^ fai + fa .(63), 

.< 64 >- 

where fa = 5(F'„); TT„ = (l - &) (l - .(66), 

^ = s{F' w (2^-l)} ...(66). 

/x is as in (60) and 

* - 10 {* (£) + f ($} +3 {? (£) + f Ci)} - 48 fe 1 ) - 68 fe‘) 

+IS8 (;$ + + “->}■-«{S (‘ + t)} + W""-)’’ 

+ 125+ 45 + s|^*(^ M - 2F S . - 2F.„)J 

- 28 fe< 2 - 9 “-> + £)} +28 fe (1 +2 “»> (£ + £)} 

.(67). 

(14) The Cose of no Contingency {continued from Article (7)). 

Now let us consider the special case of no contingency again. 

In this case we have the following special relations, besides those of Equations (20) 
obtained on pp. .386—7 : 

n IT Of ( U * Q n VQ \ __ lT _ n% * 

N*' *'~q\ n. Q )~N* V - Q ~ N ' 

% "=!(^) +8 (sr')-y-.<* + ^ 

^--gfl v.V. 

“* 8 < ,r ->= 8 fe ( l - fc) (‘ - fe)} -»<" 1)<x - J) .«■ 
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By these special relations and relations (20), the general expressions for and 


/4a' become 

- \ (* + y) (*-!><*“D- \ (l + y) ®. 

where <» = (* — 1)(\—1) .(70), 


and fit = {2(« —1)(\ — 1) + (* — l) l (X — 1)*} = ^j(2» + «>*) ...(71); 


and, consequently, we have 


(72), 


for the case of no contingency. 


V2ce /- a> to \i 

IT \ N~~2N*) 


(78), 


In the case of no contingency, the formula for /4a', obtained as the first approxi- * 
mation, became identically zero and we could not estimate the standard error of 
the mean square contingency fa*. But now we have got out of this difficulty and 
the formula obtained for this special case is very simple. 

In this case, if the size N of samples becomes very large compared to ©, 

Mean fa*-*>0 as N -► oo , 

i.e. fa*-+<f>* as N x , 

and as #-*-*> .(74). 


This last limiting value of is not of course new * 


(15) Numerical Illustrations . 

Ex. (1). Let us take first the numerical example treated in Article (6), where 
the population frequencies (reduced to the sample size of 100) are as follows: 


TABLE III. 


15 

25 

40 

40 

20 

60 

55 

45 

^■-100 


* Journal of Royal Statistical Society , Vol. lxxxv. pp. 87—94, R. A. Fisher; pp. 95—104,0. U. Yule; 
Biometrika , Yol. xx A . p. 284, J. Neyman and E, S. Pearson. It may also be compared with the value fax 

j2c * 

V S V ~ w ^ en t b® r8 iB no ^dngency and and not is used. See Biometrikch Vol. xi. 

p. 280 (1915). 
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la this case, alter calculation, we have 

8,’-lit 3450, 8 t " - -000 473, S f?) - -012 494, 

£ l «*082498, 

-9 (^) - -OOO 441, 8 g*) - 000 151, 

fifg!(«,. + «.,)}--000472. (l + ^)}--000531, 

S || (2 + 9u) g* + ~j} - -004189, S\W„}^ -009 260, 

8 j| (1 + 2u) ( ^ + ~^») J - -001413, [S (TF W )]* = -000 080, 

8 - -000 087, S = -000128, 

8 {w„(l-± + 2 ^ =>-009 300, 

and 8 1J - 2 V p . - 2 F.„) j = -000 820. 

Substituting these numerical values in the expressions for p t ' and ni directly, 
we get fix = *009 300 and n% = - 003 291. 

Or finding functional values at first, 

• yfrx =* -9260, V r *’“ ’4017, 

and fx - -3066, /, - 2 2486, 

therefore fix' ■» ^ = *009 300, 

+ --003291; 

and finally ^i 2 « <j£ a + /*i' — ‘091 798, 

and *■ V/ij' — /4x' 2 = ‘056 60. 

Ex. (2). Let us take a 3 x 3-table of contingency as the second example, wherein 
population frequencies (reduced to the sample size of 1000) are as given in 
Table IV. 

TABLE IV. 
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In this case, after calculation, we get 

/n'- -004 297, i* = -000 751, 

and £*--154 321, 

therefore ^i a = *158 618, and cr+f = *027 07. 

Ex. (3). Lot us consider a 2 * 2-table where the reduced population frequencies 
are as given in Table V. 

In this case fa, = *486 100, ^ = ‘0049, 

/x» *970158, /a = — *4488, 
and <£ 8 = *494 950, 

therefore ^ — ‘004 851, ^ = *009 657, 

and fa* = *499 801, - *098 15. 

TABLE V. 


45 

5 

60 

10 

40 

50 

55 

45 

jv-100 


Now, with regard to this population, the following experimental sampling 
results were given to me by Prof. K. Pearson. 804 samples of size 100 were drawn 
at random from this population and the distribution of fa 2 was observed. The 
following is the result of this sampling: 


TABLE VI. 



The standard errors of this fa 2 and are as follows : 

the standard error of fa* = *003 4614 *, 
the standard error of cr^s = *002 3125 *. 

* The standard errors of fa* and are given, the latter as its first approximation, by the equations 

s.e s.n.of 

where ti is the number of samples and 9, fa are the values of the theoretical standard deviation and of fa . 
Here n=804, ?= *09815, and the value of fa is given by the distribution law (81), Article (17). 
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These observed values of fa* and cr^t and the theoretical values obtained by 
my formulae for this example are in good agreement having regard to their 

standard errors; this is interesting from a practical point of view, 

* 

Ex. (4). Let us take one more population, in the proportions given in Table 
VII, and find the mean and standard deviation of for samples of 200. 


TABLE VII. 


•0831 

•0786 

•0270 

•1887 

*1032 

•2864 

•0802 

•4758 

*0335 

•1235 

•1785 

•3356 

*2198 

•4885 

•2917 

1-0000 


In this case 


r u = 0786, r 13 = -0270, r tl «1032, 

raa = •0862, r x . = ^ = 1887, r a . = -4758, 

er: $* = S(u„)~ 1 =188 893, 

- 8 ( W' m ) - 3-500 959, fa * - 8 J TT W (2 - l)J * 4-192 990, 

/i = 4 S( ~^) - 3 S (- 3S (^) + 2S ( U *- U -° U A * -801 842, 

V « \ J q V ^.5 / V xi / 


ru- 0831, 

?-** = "2864, 
and so on. 

Further: 


and finally f» = 25 558 896. 

Therefore Mean fa* = *206 503, ff,,* - V004 338 = -065 86 

With regard to the last population, I have made a sampling experiment. In my 
case, the sampling was carried out with the help of Tippett’s Random Numbers*. 

250 samples of size 200 were drawn and the 250 observed values of fa*, thus 
obtained, are recorded in Table VIII. 

The mean value and the standard deviation of fa* obtained in this experiment 
wore 

fa* = -200 72 (S.E. = -0041665)f, 
and ov - -060 86 (S.E. = -003 2328)f, 

which are also in good agreement with the theoretical values obtained by my 
formulae for this example. 


* Tracts for Computers, No. xv. 

f Those two values of the standard error are obtained in the same way as in Example (3). 
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TABLE VIII. 

250 observed values of fa*. 


•370502 

•260 342 

•226946 

•196717 

•175 246 

•149 476 

•108222 

•367 532 

•259954 

•226664 

•196035 

•175094 

*148303 

•107749 

• 36*2669 

•259 520 

•226046 

•195 403 

•173607 

•146151 

•104882 

•343 805 

•259413 

•224 281 

•193 993 

•172 689 

•139878 

. -101628 

•342 499 

•257 989 

•223655 

•193 733 

•172668 

•139776 

•099940 

•333 963 

•257 461 

•222 414 

•193 366 

•171 994 

•138 489 

•098898 

•323 583 

•266603 

•222 398 

•192 621 

•170971 

•138 444 

* *098 408 

•317 275 

•266 069 

•221677 

•192 578 

•170 358 

*137 102 

•080840 

•313 279 

••255 371 

•221 340 

•192 519 

169973 

•135 931 

•063055 

•310627 

•253 547 

*221 209 

•192 471 

•169 758 

•132 744 

•054201 

•309207 

•252 659 

•221149 

•190377 

•169707 

*131 848 


•308 969 

•252 478 

•220887 

•189 337 

•169 369 




•251 270 


•189 269 

•167995 

•130332 


•305 968 

•249042 

•219244 

•188 763 






•218 992 

■188 640 

> -167 031 

•127 415 



•245 817 

•218 488 

•187 250 

•166 349 



•297 917 

•244 628 

•218175 

•186 388 


•126 732 


•297 674 

■ Him 

•218015 

•185 480 

•164 243 

•124 849 


•293363 

M4ETSVM 

•217 070 

•184 789 

•164 239 




•243 238 

•216009 

•184 667 

•162 711 

•124 489 


• 2^1491 

•243118 

•215 974 

•184136 

•162 266 

•123836 


•290145 

•243001 

•215 844 

•183903 

•161 765 

•123 812 


•289985 

•242921 

•213 582 

•183355 

•161 686 

•122 143 


•289051 

•242 213 

•212 336 

■182900 

•161 603 

•120799 


•288 586 

•241 357 

*210517 

■182 841 

•161 602 

•120244 


•288 480 

•240918 

•209 856 

•182 334 

•161 006 

*120097 


•287323 

•239 461 

•209827 

•182 247 

•159840 

•118360 


•284 562 

•239216 

•209 801 

•181 588 

•159 544 

•117135 


•281 469 

•237 815 • 

•208 716 ' 

•180964 

•159 220 

•116869 


•276664 

•237075 

•206 624 

•180957 

•168 757 

•116355 


•276 582 

•234 214 

•205 565 

•180618 

•158391 

•116274 


•275 174 

•234103 

•204131 

•179411 

•156 700 

•115913 


•276082 

•233776 

•202 800 

•177 843 

•156 316 

•115 724 


•272184 

•232690 

•202 653 

•177 842 

•154 973 

•115 238 


•268507 

•232 592 

•202 077 

•177 014 

•154359 

*114 232 


•268 218 

•230464 

•202 019 

•176 487 

•154 290 

•114054 


•267 703 

•229020 

*201 639 

•176 190 

•153763 

•113263 


•266022 

•228 391 

•201141 

•176085 

•153347 

•112 327 


•266009 

•228365 

•199 410 

•175 774 

•151 797 

•112003 


•261 559 

•227 032 

•198342 

•175 522 

•150002 

•110667 



(16) We have now reached three approximate formulae for /**' ( fa ®), e.g. 


Pt = = jif .(A), 

ji(2a + ®*) .(B), 

for the case of no contingency, 

and .(C). 
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Of these formulae, the first two are very simple and very convenient for 
practical application. But the second one is applicable only in the case of no 
contingency, or in cases where at least the mean square contingency Of the population 
is very small; in such cases, the formula (A) loses its accuracy. Therefore in some 
cases the third formula becomes very necessary,.but this formula ts not simple; 
it is too complicated for practical purposes and, from the practical point of view, 
it is desirable to use the simple formulae as much as possible. 

What are the limits to the application of the formulae (A) and (B)? This is 

an important question 'and, although it is difficult to give any general criterion aB 

a guide, the following considerations provide some suggestions on this point. 

* 

Let us consider the four examples in Article (15) again, finding the values of 
fa* and by applying the formulae for the first approximations only, and then 
let ns compare the results with those already obtained. We get the results given 
in Table IX. 


TABLE IX. 


■ 

& 

liauge 

Mean and 
B.D. 

1st Approxi¬ 
mations 

2nd Approxi¬ 
mations 

Ex. (1) 

2 x 2-table 

*082498 

mm 


•091 758 
•054 58 

•091 798 
•05660 

Ex. (2) 

3 x 3-table 

•154321 

from 0 to 2 

?. a 

•157 925 
•02684 

•168 018 
•02707 

Ex. (3) 

2 x 2-table 

■484 060 

from 0 to 1 


*499801 
•098 50 

*499801 

•09815 

Ex. (4) 

3 x 3-table 

*188 893 

from 0 to 2 

w , 

•206 398 
•06085 

•206 603 
•06686 


In these examples, although, except in one case, the values <j>* are rather small 
as compared with the ranges, the values fa 2 and of the first and second approxi¬ 
mations are in good agreement. 

1*1 11 
Now + aud + 

where fa, fa, fi and /* are all functions of proportional frequencies only, mid 
accordingly may be considered as constants when N, the size of repeated samples, 
alone changes its value. 

In this case it is evident that 

Hi’+jffa “id Mi-►y/i. 

as $'-*• oo. . 
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Therefore, unless the mean square contingency of the sampled population be 
zero, i.e. if J\ is not identically zero, we can get good estimates of <f> i* always from 
the simple formula (A) by making the size of the samples large enough. 

For instance, for a given population, if N satisfies the following condition, 

/xxO-lsil/il, 

or N £ \M (= N c , say) ..(75), 

then the second term fijN is numerically less than 10 % of the first term f l9 and 
we can get for many purposes adequate estimates from the first approximation 
formula. 

Let us take once more the four numerical examples above and find the value 
of N 0y and see also how the second term changes its value as compared with the 
first term when N increases. 

The results are given in Table X. 


TABLE X. 



Ex. ( 1 ). 2 x 2-tal>le 
i 2 = -082 498 
Range: 0 1 

/, = • 3066 
/„ = 2-2486 
jV,-73*340 

Ex. ( 2 ). 3 x 3-table 
d 2 = -154 321 
Kango: 0-^2 
f\ « *7332 
/u = 18'19 
iv c -248*l 

Ex. (3). 2 x 2-table 
0 2 --494 950 
Kango: 0 -*-l 
ft ■» ’9702 
/a =‘4488 
JV c -4-626 

Ex. (4). 3 x 3-table 
**=•188893 
Range: 0-^2 
/.-•8018 
/a-25-559 
.A*.-318*8 

N (size of 
sample) 

/ 3 

N 

a 

/ a 

N 

A 

h 

N 

A 

A 

N 

A 

TV- 50 

mm 


•3638 

•7332 

*0090 

•9702 

•6110 

•8018 

100 

•0226 

•3066 

•1819 

•7332 

•0045 

•9702 

*2556 

•8018 

200 

*0112 

•3066 

•0910 

•7332 

•0022 

•9702 

•1278 

•8018 

500 



•0364 

•7332 

•0009 

•9702 

•0511 

•8018 

TV -1000 

111 55$ SB 

•3066 

•0182 

•7332 

*0004 

•9702 

•0266 

•8018 


•0011 

•3066 

•0091 

•7332 

•0002 

•9702 

•0128 

*8018 ' 



= •9260 

^ i = 

= 3*6044 

^,«-4861 

3*5010 


^=*•4017 

^2 = 

= 692*60 


*2 = 4*1930 

Notes 




■ Mill ^1 



= •091 798 


* 1 *- *499 801 


= •206503 


»= *066 60 

<r*,« = 

•027 07 


•09815 


•066 86 


We see from these numerical values that the application of the simple 
formula (A) is not so very limited; if the size N be large and be not very 
small we can get a good estimation of tr+f from the simple formula (AX 
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VII. Distribution of fa*. 

(17) From the definition of the mean square contingency fa*, it is evident fa* 
is always positive. And the closest relation between two characters A and B that 
can be shown in a contingency table consisting of a given number of classes will 
occur when all the individuals having a given A character fall into a single class 
for the B character and vice versa. 

This form of relationship can only occur in a square table, i.e. when 

X = * (« m, say), 

and this closest relationship corresponds to a value of fa* =* m — 1, which is the 
maximum value of fa* that can occur in a sample from any m x m-table*. 

Therefore, we must assume that the mean square contingency fa* can vary 
from 0 to m — 1 according to a uni-modal curve of limited range; thus a Pearson’s 
Type I curve 

y - yo® Pi (fi — #) p * 

may be expected to give a reasonable approximation to the distribution law of fa*. 
Now let us assume that the distribution law of fa* is of the form 


y = yo(fa*) Pl (b-fa*¥t, 

then Range b = fa* (max.) ■» m — 1 ..(76), 

and by well-known theoremsf 

fa*-^ .(77 a). 


. 

and M (total frequency) = y 0 b ir ~ l) . 

where p**pi + l, g=p* + l, and r = p + q. 

From the Equations (77 a) and (77 b), we can easily deduce 
fa*(b-fa*) , __r lt __ 

'- j, - 1 . i-'-r- 

where p% — { 0 >,«) s . 


(77 b), 
(77 c). 


(78). 


* This ossa corresponds to the case where the coefficient 1-i, whioh is the maximum of <7, 

for m x m olasses. 

t If y=y 9 xH (b - x)*’s be the distribntion law of the variate », then 


and 


Mean *=*=■! f\sdx= b l=^±^, 

M (total area] = ( b , 
Jo A 

f!^ ldx=b TTrTiy =fl * « +iJ - 
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Thus if we know M, m, fa* and p*, i.e. we can at once find the values of 

b, p lt pi and y 0 > and we thus obtain the following equations giving the distribution 
law of 0! 2 : 

.:.( 79 ), 

where b = m — 1, 

P* 


Pi 




Pi-3; 


and 


.._ ML i£i ± p* ± 2 > 

y ° b 'p^ +i » r ( Pl +1) r (p 2 +1) 


.(80). 


Let us, for instance, take the population in Table V. In this case evidently 

7)1 » 2 , 

and, as we have afc-eady calculated, 

^ = *499 8005, /i 2 - [a**] 2 = *009 6332. 

Therefore 6=1, and from the Equations (80) 

Pl = 11*471 0186, p 2 - 11-480 9744. 

Accordingly the required distribution law of (f> x 2 in this case becomes 

y = yo(^ a ) u * 4710186 (l - tf>i a ) u,4flW44 .(81)* 

very nearly a symmetrical curve. 


(18) Verification of the Distribution Law of (f> i 2 . 

It is necessary and also interesting to examine the accuracy of the distribution 
law (79) with the help of our experimental results. 

(i) The First Experiment. 

From the population in Table V, 804 samples of size 100 were drawn at 
random, as I mentioned before, and the observed distribution of these 804 (f> i 2 s, 
which were also given me by Prof. K. Pearson, is n 8 \ given in Table XI. 

The theoretical frequencies in the third and sixth columns of this table are 
frequencies in groups, given by Equation (81). 

In this case M = 804, 

and from the Equations (77 c) and (81), we get 

log y 0 = 10-410 6351, 

or 2/0 = 2-574157 x 10 10 . 

Now we can find the mid-ordinate y 8 of any group from the Equation (81). 

* If the frequency function (b - x)*" 1 , theu fa for this distribution is given by 

3 (r+ 1 ) {2r*+p ? (r- 0)} 

M (r+ 2 )(r+ 8 ) ~ ’ 

and for the distribution law (81), we have &=2-785 35, which was needed to find the standard error of 
the standard deviation 04 ,* in Artiole (15). 
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The theoretical group frequency n 8 in the table was calculated from these 
mid-ordinates, applying the formula 

n '• ** gfQQ {^1^8 y 8 4- 208 (y,_! + y, +1 ) —17 (y#-a 4- y#+a)|> 
where h is the class interval. 

We can now compare the two sets of frequencies by the test for goodness of fit. 


TABLE XL 



In each group the frequencies are in good agreement, except in a few groups 
where the irregularity in the observation arises from the fact that, in sampling, we 
must have whole numbers only in the cells*. 

Grouping together as indicated by brackets to avoid this irregularity,* we get 

X a = 7*329 931, 

and consequently P = *770 775. 

From this value of P we can say that the distribution of <f> i a , obtained by the 
above method, is well satisfied by this experimental result. 

* It must be remembered that fa* is not really a continuous variate for tables with a limited number 
of (sells; whole numbers only in the cells can be provided by the samples, and theBe lead to a discontinuity 
in fa*, the more noticeable the smaller the number of cells. 

Biometrika xxi 


28 
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Scaie of <J> ( 1 m Sfcflipks 
Fig. 1. 



T. Kondo 


419 


(ii) The Second Experiment. 

In this experiment, mentioned in Article (15), for the population in Table VII, 
I reached the distribution of fa* given in the first column in Table XII. 

Now the following theoretical values have been already obtained: 

£,* = •206503, = -065 863, 


and further m = 3, M = 250. 



From these numerical values and the distribution (79), we got the following 
equation for the theoretical frequency of fa* for this case, 

where p, = 7712 036, p* = 7*664 4051 .(82). 

and y„ = 7-648 80 xlO 14 J 


28-2 
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The frequencies n H in the third column of Table XII aro the theoretical values 
obtained from equation (82), and n, in the same table gives the corresponding 
observed frequencies. 

We can now compare the theoretical frequencies with the observed ones. 

For the same reason as before we group together as indicated by brackets, and 
testing for goodness of fit, we have 

tf-n-9181, 

and consequently i > = ‘tS69 245. 

With this value of P> we can say that the distribution (79) of is satisfied 
again by another experimental result. 


TABLE XII. 


(central 

values) 

«• 

(obs. freqs.) 

(theor. freqs.) 

0*45 


' 

•228 

*43 



•418 

•41 



•743 

*39 



1-285 

•37 

3 


2*148 

*35 

2 

l 

3*473 

•33 

2 


5*415 

*31 

8 


8*123 

*29 

14 

11 -670 

•27 

12 

10*003 

•25 

25 

20*820 

•23 

27 

25-530 

•21 

25 

29*250 

•19 

33 

30*958 

•17 

35 

29*810 

•1ft 

17 

25-575 

13 

23 

18-975 

•11 

18 

r 

11 *65 ft 

•09 

4 


5-400 

•07 

1 


•735 

•05 

1 


•263 

•03 



•005 


(19) Now, in practice, what we really have are samples, and the population, 
from which these samples are drawn, is usually unknown. Therefore, it is important 
to examine what we can find out about a population by the above theory from the 
individual samples which may be drawn from the population. 

We have two sets of samples. One set of 804 samples is drawn at random from 
the population given in Table V, and the other set of 250 samples is drawn at 
random from the population given in Table VIL 





T. Kondo 


421 


Suppose we take from each set ten representative samples; this I have done 
by taking the two theoretical distribution curves of fa* (81) and (82), breaking 
each of them up into ten strips of almost equal area and choosing from each set of 
samples, ten samples with a fa* as near the central value of each of these strips as 
possible*. 

The fa*’a of these representative samples are given in the first and the fifth 
columns of Table XIII. Substituting the cell and marginal frequencies of these 
20 samples in my formulae for pi and pj instead of the population frequencies, 
I obtained the values of E, the estimate of fa* from sample, and of a $,* given in the 
same table. 

TABLE XIII. 


The 2 x 2-table 

The 3 x 3-table 

True 0 , 3 *494 950, Theoretical <r$,« = *09815 

True 0j2=£ a = *188 893, Theoretical <r^,*= *065 86 

ft 2 

(sample) 

E 

(estimate 
from sample) 

(from sample) 

\ = 

(E-&\ 

\ I 

(sample) 

E 

(estimate 
from sample) 

<M>i s 

(from sample) 

\ ) 

■725 550 
•GOO 12G 
•560022 
•5400(H) 
•510592 
•481 668 
•460000 
•4400(H) 
•401 296 
•274 350 

•723 566 
•696334 
•556072 
•5363(H) 
*515 070 
•477 287 
•455038 
•431 731 
•395 269 
•267 254 

•079 29 
*097 63 
•09515 
•091 96 
*098 19 

093 81 
*097 72 
•097 70 
■098 72 
*08819 

2-883 
1-038 
•642 
•450 
•205 
- -188 

- -408 

- *647 
-1*010 
-2*582 

•370 502 
*281 469 
•249042 
•230464 
•206 624 
•195 403 
•184136 
•172 689 
•150002 
•080 840 

•355666 
•265016 
•231 579 
•213 270 
•190012 
•176666 
•165 823 
•154 755 
•132181 
•061 738 

•064 70 
•079 31 
•072 79 
•072 93 
*060 86 
•056 34 
•05411 
•058 76 
•04698 
•03842 

2*578 

•960 

•586 

•334 

*018 

- -217 

- *426 

- -581 
-1 *207 
-3*310 


From these values of E and a^ we see that, if the sample deviation in fa* 
is not large, we can obtain good estimates of the true populations, but the 
estimates become worse as the sample fa* diverges from the true value $ 2 as we 
should expect. 

Now if we find and /a 8 ' from my formulae by introducing sample values and 
estimate the true fa*, i.e. $* and from them, then we might practically use the 
rough rule that the population fa* “almost certainly ” lies within the range 
r /sample estimate\ - / calculated \ 

" \ of true fa* ) “ Vfrom samples* ’ 

for we have only one exception in twenty random samples, or 95 % lie within the 
range, and indeed “most probably” lie within 

E ± 2 

since 80 % within this range. 

* In the tail strips, the medians are used instead of their centres, as there are no samples very near 
the centres. 
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For this reason I have found the ratio 



and the results I obtained are given in the fourth and eighth columns of Table XIII, 
which throw some light on how far this rule would be justified by showing the 
result of application of this rule to a typical series of samples. 

Now if the distribution law of a variate x be known its modal value will be 
the single value which will most probably occur in a single sample, and if the 
distribution law is of the form 

and ir, fi 2 are the mean of x and the second moment coefficient about the mean, 
then the probable value of x —the modal value of x —is given by 

The distribution law of <f>^ is of the above form in my theory and as the probable 
value of (t>i 2 (^i 2 , say) is one of the most important estimates of 1 have found 
<J>, 2 from the representative samples and from the population. 

The results obtained are given in Table XIV, where <t>i* is the estimate of 3>! 2 
from samples and the values of the ratio (4> x ' 2 — ^i 2 )/^ 2 show that we can get 
adequate estimates of 4> x 2 from samples, except in the case of a few samples from 
extreme strips. 

TABLE XIV. 


The 2 x 2-table 

The 3 x 3-table 

Theoretical probable value 4> J 2 = *499 784 

Theoretical probable value $j s =*187238 

0T* 

(sample) 

cJjj'2 

(from Bample) 

V 

(sample) 

v 

(from sample) 

v-v 

+i* 

•725 556 
•60(3126 
•560022 
*540000 
•519 592 
•481 668 
*460000 
*440000 
•401 296 
•274 350 

•714 293 
•604 991 
•560 694 
•539049 
•516 386 
•475 495 
•480486 
•425 716 
•385 539 
•246 266 

*429 

•211 

•122 

•079 

*033 

-•049 

-•039 

-•148 

-•229 

-•507 

•370502 
•281 469 
•249042 
•230464 
•206624 
•195 403 
•184136 
*172 689 
•150002 
•080 840 

•346240 
•244044 
' *210893 
•190350 
•171983 
•169945 
•149 283 
•133 545 
*116 239 
•022321 

•849 

•303 

•126 

:017 

-•081 

-•146 

-•203 

-•287 

-•379 

-•881 


Moreover, if we know the distribution law of $j a , we should expect more 
samples in random sampling about the probable value 4> x a than about the mean 
*i a or the population value <£ a . 
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To examine this point I counted the numbers of samples found in the two sets 
of observed samples which lie within the ranges 

$1® ± i<>V, ft ± k<r<h* and ft 2 v**, 
where a*,* is the theoretical standard deviation already found, i.e. 

= '065 86 for the set of 250 samples, 
and ( 7 $* = '09815 for the set of 804 samples; 

and obtained the following results as might be expected: 



Number of samples 
within 

Number of samples 
within 

N umber of samples 
_ within 

The set of 250 samples 

55 

54 

46 

The sot of 804 samples 

68 

60 

68 


In the second case 

4>!* =-499 784 and *499 801, 

and the difference = 000 017. 

Therefore 4>i* and ft 2 arc almost the same and this accounts for my obtaining 
the same number of samples about 4>i 2 and ft 1 in this case. 

If they had not been almost the same we should probably have found more 
samples about than about ft 2 as in the first case. 

[ Remarks.] (i) Mean <f>i 2 in samples = pop. <f> 2 + fix'. 

Therefore in the long run the sample fa 2 will be larger than the population <£i a 
by a quantity ni and the estimate of population </>i 2 , obtained from the sample, 
which is denoted by E in Table XIII, would be given by 

E = sample <f> 2 — fix'. 

(ii) The last sample in Table XIII, drawn from the 3 x 3-table, illustrates a 
difficulty which may always arise in a problem of this kind. The observed fa 2 in 
this sample is ‘080 840; for a 3 x 3-table with no contingency we have, neglecting 
terms in 1/N 2 , 

Mean fa 2 in sample =-= '02, 



Hence we should conclude that the sample could not have come from such 
a population. If we were, however, to form the possible upper limit of in the 
population and to say that it could not lie above 

E + 30-^j* (obtained from sample) = 176 998, 
we should in fact be in error as the population value of fa 2 actually = 188 893. 
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The erroneous conclusion arises in this case, and might well arise in other 
samples from this population, because the standard deviation of <f> x % calculated 
from the sample changes very considerably with changes in the sample frequencies. 
This is partly due to the fact that the <f>i 2 in the population lies relatively close 
to zero. 

When we come to the other samples where the population <f> x 2 is larger, although 
the size of sample be less, the cr^a's calculated from our ten representative samples 
are much better estimates of the true cr^a’s. 

Our estimates will improve (a) as the size of the samples increases and ( b) as 
<f>i 2 increases. 

(iii) In the calculations of Tables XIII and XIV, I have used for practical 
reasons the formulae of the first approximation only. If I had used the formulae 
of the second approximation, I should have obtained better values for all my 
estimates. 

Lastly it must be noted that the calculation of cr+t from my formulae (even those 
of the first approximation), using either the population or the sample frequencies, 
is a somewhat lengthy business. For a sample from a 2 x 2-table and by the 
formulae of the first approximation, I have taken about 45 minutes, and for one 
from a 3 x 3-table, a little more than 1| hours. 

But if problems occur when it is of real importance to know the reliability of 
<J> 3 2 , the result of the labour may be well worth its expenditure, especially if the 
samples be large. 


VIII. Mean and Standard Deviation of the Coe fficient of Mean Square Contingency . 
(20) The coefficient of mean square contingency C 2 is defined by 

tWfrva+fc*).(83). 

If we write, for simplicity, as follows: 


/ = nrf 2 and ewew+n 


and let 8(7 2 , 8<f> i 2 be the deviations of C 2} <f>x 2 respectively from their population 
values 0 2y </> 2 , then 

( 'i = \A - i + £a f ~ 1+i 2 


= 1 + 2 (- 1)« 


Pmf* 


—i- (i+/*fr*r . 

where p m is the coefficient of the (m + l)th term of (1 +#)*, 

(1 +/ty 1 *)-» = 1 - m/Stf + m (m + 1} / 2 W)*. 


.(84), 


and 


(85). 
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The infinite aeries (84) ia convergent, if 11 + > 1. For 0, this 

is true if (o) 8</>i 2 £ 0; or (6) 8^i* < 0, provided | 8<fo* | < <£ 2 . For = 0, 8<^i* is 
always £ 0. The infinite series (85) is convergent., provided 

\f I < 1» i- e - | 8tyi 2 | < 1 + $>*• 

Therefore if 1ftyi* | < <£ 2 or 0 < $<f> j* < 1 when <f>* = 0, then both infinite series (84) 
and (85) are convergent. But these conditions, in the more usual cases, may be 
assumed quite reasonably. 

Now from the equations (84) and (85), 

C t - 1 +i(- 1 rp*f m j 1 - •»/(**I 2 ) + — g --/ 2 W) 2 - •••}, 

and O, = Vl - — 1 - 2 = 1 + I (- l)W m - 

Y 1 + 9 w*=i 

Therefore S(7 a = C a - (7 a 

- I (- l) m+1 mp m / |m+1) (tyS) 

m~ 1 

+ 5 (-1 r ” - ( y a —W) 2 +... 

Wt — 1 * 


4 1 (S^ + i a (W 2 +.(80), 

where A?j = £ (— l) m+1 mp m / r{M,+1) l 

m = l 

«d h- I • 

m= l “ 

Therefore, we have 

S0 2 = ki (8^*) + k a (8<£i 2 ) 2 (approximately).(87), 

and thus (C t )=Mean 80 2 

= k\fA\ (<£i 2 ) + ^ 2 / 12 ' (</>i 2 ) (approximately).(88 a), 

/**'(0*) = Mean (80,)* 

= &i a /* 2 ' (<£i 2 ) (approximately) .(88 b). 
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Similarly 


k t = — 


1 + 3(7** 


SC» a (1+ <#>*)* 


.(91). 


and 


From (88), (90) and (91), 

Mean G t = 0% + - ^ (approximately) ...(92), 

2 8 2(7 a (l+<^)* 8C a * (1 + <f>*f 

in rough approximation .(93). 


<r r ,= 


<r«,i 


2<\(l + ft) 3 

But we have already found that 

therefore we can find the theoretical values of C 2 and <r Ci approximately, and the 
following results are obtained: 

Gz = G 2 + ^ (^i y l r i + ^2 fi) + ^ra (^i ^2 + kzfz), 


and <t Ci = £1 V'/x 2 ' - (/*i') a = k x * .(94). 

For instance, let us consider again my experiment of sampling from the 
population in Table VII, Article (15). 

In this case <f> 2 = *188 893, and 

fMi (fa*) = *017 CIO, fiz (</>i 2 ) = *004 648, 

as already found. 

Therefore from the equations (90) and (91), 

k x *= -887 459, l\ * - 1*734 399, 
and finally from (92) and (93) we get 

C 2 - *406 1G6, and <r Ci = *058 451. 

Now for this population we can get 250 observed values of Cz from the 250 
values of fa 2 , obtained in my experiment. 

I calculated these values of C 2 and found the observed values of G% and cr Ca . 

The following results were obtained: 

Mean C 2 = *4028 (its s.E. = *003 697 )f, 
and <r Ci = *0528 (its s.E. = *002 536)f, 

while their theoretical values, given by my formulae, are 

O 2 = *406166, cr Cl = *058 451, 
and therefore we can say that they are in fair agreement. 


* Thus o c =to a first approximation, the value given by Blakeman and 
a 2C 8 (l + ^) a (1 + ^)1 

Pearson, Biometrika , Vol. v. p. 191, eqn. (xxxv). 

t These standard errors are obtained in the same manner as indicated in the footnote to Article (15), 
p. 410. 
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(21) Finally let us consider the distribution of Ci. 

It is well known that C 2 is always positive and ranges from 0 to 



where m is the number of categories, the latter value resulting when there is as close 
a relation as is possible between the variates. 

Therefore let us assume in the same way as in Article (17) that a Pearson’s 
Type I curve will give us approximately the theoretical distribution of also by 
the same method of deduction as on p. 415 we get the following equation as 
the frequency distribution of C 2 : 

y^yoCfi (b - C 2 )pi, 


where 


and 


v -' * b ( *„• 


VI 


- 1 , 


0*(6-G g ) 

Ma _ MT(pi + pg + 2) 

y o 


. (95). 


'b'vi+r'** 1 ' r(pi + i)r(p 2 + i) 

For instance, if we consider the distribution of the 250 values of 0 2 , given in 
Table VIII on p. 412, 

C 2 — *4002, a c% = *0585, m = 3, and M = 250, 
and from the equation (95) we get the following equation as the frequency distri¬ 
bution of C 2 for this case: 

y « y 0 Cf*™*(-H\Q5 - C 2 )*™ 

where y Q _ 4 33150 x 10 5 * 0 .(96). 


TABLE XV. 


C, 

(central 

values) 

Observed 
frequencies 
of <? 2 

Theoretical 
frequencies 
of e a 

C 2 

(central 

values) 

Observed 
frequencies 
of C 2 

Theoretical 
frequencies 
of C 2 

*17 


f 

■003 

•41 

28 j 


33*370 

•19 



•019 

•43 

36 ‘ 

t 

30*930 

•21 



•087 





‘23 

1 


*313 





•25 

1 


•925 

•45 

31 J 

f 

26*682 

•27 

1 


2*304 

•47 

20 

{ 

19*054 

*29 

1 


4*928 





•31 

10 

\ 

9-166 



- 






•49 

9 


12*678 





•51 

6 


7*342 

•33 

25 


14-984 

•53 



3*768 

•35 

10 1 

t 

21-607 

•65 



1*668 





‘ *57 



•633 





•59 



*202 

•37 

20 1 

r 

27*987 

•61 



*063 

•39 

42 j 

i 

32*282 

1 




*012 


The theoretical frequencies in the third column of Table XV are provided 
by the above equation. 
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I examined the distribution of the 250 observed values of C* and the results 
obtained are given in the second and fifth columns of the same table and in the 
diagram below, Fig. 3. 



Fig. 3. 

Grouping the frequencies together as indicated by brackets in Table XV for 
the same reason as in Article (18) (i), we obtain by applying the ^ 2 -test, 

= 8*4764, 

and consequently P = *133 763. 

This value of P is not so good as in the case of fa 2 discussed above, but it is 
reasonable, and we can say that the distribution law (95) for the coefficient of 
mean square contingency G 3 is not contradicted by this experimental result. 











MISCELLANEA. 

Note on Variability in Girls and Boys (Glasgow) for Height and Weight. 

By ETHEL M. ELDERTON. 

In a memoir published in Vol. x. (pp. 288—339) of this Journal, I dealt with tho regression 
curves of weight and height on ago of a very large number of Glasgow children. Those children were 
grouped in four classes of schools, A, B, 0, D, according to the economic conditions of the district. 
In that papor although they wore necessarily computed I did not publish the standard deviations 
for each age-group, as 1 was seeking only the change in height and weight with age. My num¬ 
bers were so largo that it was possible to obtain standard deviations with a relatively small 
probable error. I have since found several investigators inquiring for reliable measures of the 
variability in stature and weight of boys and girls. Accordingly I give here two Tables. The 
first provides the Means, Standard Deviations, and Coefficients of Variation of the ontire popu¬ 
lation of children not divided into the groups A, B, C and D. Tho second gives the Means, 
Standard Deviations, and Regression Coefficients of Weight on Height for school ages of the four 
grades. Tho ages are central ages. 


TABLE I. 

Means, Standard Deviations , and Coefficients of Variation of Glasgow Girls 
and Boys at various ages for all Schools combined. 

Girls 


No. Of C&B68 
and age 

Means 

Standard Deviations 

Coefficients of Variation 

Height 

Weight 

Height 

Weight 

Height 

Weight 

5(894) 

6 (3104) 

7(3828) 

8(3928) 

9(3819) 

10 (3762) 

11 (3618) 

12 (3658) 

13 (3225) 

14 (1229) 

40*188 

41*727 

43*538 

45*375 

47*315 

49*163 

50*973 

53*019 

55*211 

57*261 

38*129 

40*570 

43*921 

47*519 

51*792 

55*968 

61*151 

67*114 

74*778 

82*757 


4*2959 

4*6896 

5*1844 

5*7708 

6*5072 

7*3037 

8*2768 

10*1768 

12*0868 

13*6973 

6*01 ± *10 

6 *10 ±*05 

5 *97 ±*05 
5*88 ±*04 
5*91 + *05 

6 *82 ±*05 

5 *85 ±*05 
5*88 ±*05 
6*99 ±*05 
5*43 ±*07 

11 *27 ±*18 

11 *56 ±*10 

11 *80 ±*09 
12*15±*09 

12 *56 ±10 
13*05 ±*10 
13*54 ±*11 
15*16±*12 
16*16±*14 
16*55 ±*23 


Boys 




38*829 

2-4349 

4*3547 

6*07 ±*09 

11*22±*17 

6 (3322) 

41*937 

41-775 

2*4364 

4*7906 

5*81 ±*05 

11 *47 ±*10 

7 (3903) 

43-726 

45*274 

2*6294 

5*3286 

6*01 ±*05 

11 *77 ±*09 


45*784 

49*275 

2*8832 

5*8846 

6 *30 ±*05 

U*94± *09 

9(4017) 

47*680 

53*696 

2*8154 

6*4161 

5 *90 ±*04 

11 *95 ± *09 


49-626 

58*371 

2*8226 

7*0398 

5 *70 ±*04 

12 *06 ±*09 

11 (3761) 


63*020 

2*8504 

7*7292 

5 *57 ±*04 

12*26±*10 

12 (3632) 


68*199 

2*9412 

8*7298 

5*56 ±*04 

12*80±*10 

13 (3638) 


73*569 

3*2333 

10*2444 

5 *93 ±*06 

13*92±*11 

14 (1467) 


79*180 

; 

3*3144 

12*3372 

6 *89 ±*07 

15 *58 ±*20 
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TABLE II. Means, Standard Deviations , and Regression Coefficients of Weight on 
Height of Glasgow Girls and Boys at various ages in four grades of Schools. 

Girls 



Schools A 

Schools B 

Age 

Mean 

Standard Deviations 


Mean 

Standard Deviations 


Height Weight 

Height Weight 

i 

Height Weight 

Height Weight 


Regression 


11 - 835*8 

12- 4440 


4203$ 

43-743 

40*556 

43-934 

45-601 

47*721 

47-352 
49*162 
51054 
53017 
55’243 
57062 

51-844 

55-771 

60*839 

66*779 

74-316 

81-320 


10-0822 

11- 8143 

12 - 6811 


41- 283 

42- 951 
45066 
47037 
48-815 
50-598 

52- 284 

53- 849 
55186 

40-884 

44178 

47*993 

52-253 

56-716 

61-567 

66-401 

71-680 

75-593 

2-4988 

2-6881 

2-9152 

2-9094 

2-9170 

2-8865 

2-8980 

2-9911 

31839 





9-6112 

11-1960 


49-542 58-384 

51-072 62G74 


9-8383 

10*7314 



Age Mean Standard Deviations Mean 

-Regression - 

Height Weight Height Weight Height Weight 


4- 4488 

5- 2686 

5- 6187 

6- 4095 
7*1487 

7- 8814 
9-9720 

12-0201 

12-7213 


Schools D 


Standard Deviations 


41*943 41-326 


47*636 52-694 

49-411 56-907 

61 190 61-933 

53*261 68-353 


57-032 83 



Height Weight 


Regression 



6 42125 42*532 

7 43-968 45-900 

8 46*223 50-076 

9 48-060 54-383 



43031 

44-809 

40-893 

49009 

60-850 

43*301 

46-642 

51-161 

56-264 

81-163 

52-606 

64*244 

66- 974 

67- 710 

66*268 

70-848 

76-886 

83*214 





4’6533 

4-9927 

5*4587 

6- 3646 
0-8466 

7- 7596 
8*6799 

10-5429 

11*3919 
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Note on a Paper publiehed In Biometrika, VoL xix. 

By J. O. IRWIN, M.A., M.Sc. 

In my paper, Biometrika , Vol. xxx. p. 225, “ Oil tbo Frequency Distribution of the Means of 
Samples from a Population having any Law of Frequency with Finite Moments, with special 
reference to Pearson’* Type II,” I hod occasion to evaluate the integral 


, v If sin n 0 _ AJA 
?(#)«- / — 2 jr- cos 2x6d6. 

it J “ 


The result obtained was correct, but there is an error in tho demonstration which requires 
pointing out. I obtained the identities 


n oven = 2 r. w (x) = 


s (-D' 

2 2r ir 


7:.E<- 


1 )• ' lt C 9 {cos 2 (r - s+&) 6 4 - cos 2 (r - s - x) 6} 


n odd «* 2 /* 


+(-I)^C r oo9 2^]~ .(1), 

-1. • (*) = 2^-4 [ X ( - x y -' r ~' C > v sin [2 (r- 4+*) - 1 ] 6 . 

in [2 (r-s-x) . (1 )bi*. 


+ H111 


I then asserted that we had to evaluato a number of integrals of the types 

sin { 2 k- 1)0 


and by integrating each of these by parts I obtained tho results 


d 6 . 


■•(*). 


and 


/■” sin( 2 i- 

J — " 


—jji— M= ± ( 2 r-l) ' l — aocordm 8 48 5 0 , 


-DO 


dd=±''~ 


(- l) r_1 (2£— l) ar-a 


- according as 2 i? -1 5 0 . 




( 2 r — 2 )! 

Now owing to a pole at 0=*O, each of the integrals ( 2 ) is divergent and tho results (3) are incorrect, 
the true results being in fact infinite. 


The proof may be made rigorous as follows: 


The integral 


w(j?) = 21im€-^0 




cos 2x6 d$. 


Now the integrand of w{x) contains no pole at 0** 0 , therefore its Laurent expansion in positive 
and negative powers of e (if it has one) will be such that the coefficients of the negative powers 
of e all vanish. 


By making use of the relations 


/: 


0 Sr p=o (Sr—1) 


(2£)pco 8 (2*<+-g ) ( — l) r (2t)^ f * siu 2 M , 

(2r — 1)! j, 6 m ' 


(2 r- 


( 21 k -1 y sin (( 2 /fc— 1 )*+^) 


«nM-1) 6 M _ V v V v ' U + 

J. e 1 '- 1 p to(2r-2) ... 


( — l) r ~* (2k— l) 2r ~ 2 


(2r-2) I 


we can obtain 


j“ f ( 8 ) dff and j" 4 ,( 8 ) d 8 , 


pi" (*^11*44 

.(4), 


where/and <p are the integrands of ( 1 ) and ( 1 ) bis. 
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By making use of the expressions 

n even, 

and — j <t>(3) d3, n odd, 

and expanding all tho sines and cosines and the integrals in powers of r, we obtain the Laurent 
expansion for w t (x). As we have seen, the coefficients of all the negative powers of t must 
vanish ; further no contribution is made to the term independent of « by the series in (4) 
(because each series consists of cosines divided by odd powers of c and sines divided by even 
powers of *). Thus these series only contribute to the positive powers of « in the Laurent expan¬ 
sion, and these vanish in the limit. Thus only the integrals on the right-hand side of (4) need be 
taken into account in evaluating 

w (x) * 2 lim.^o (#), 

and these terms wore the ones taken into account in my i>aper. Hence tho final result given there 
is correct. 


The following misprints in the j>ai>or may be pointed out: 
p. 234, line 12 for £(r) r2r C r read £( — l) r2r (7 r ; line 15 should read 

sin ' lr 3 cos 2x3 — 2 {£ (-1) - 2r 1 C t [cos 2 (r-s+x) 3+ cos 2 (r-s+x) 3 ] \ + * C r oos 2x3. 

p. 235, lino 6, for ~ read . 

p. 236, line 5, for sin[2(r—*+# — !)-1]0 read sin[2(r — «-f 
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